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EDITORIAL 


DNA patents revisited 


bipartisan proposal to modify restraints in pat- 

ent law was debated before the United States 

Senate Judiciary Committee this month. Imme- 

diately following the third hearing last week, 
leading scientists called for a study of the rel- 

evant issues by the U.S. National Academies of 
Sciences, Engineering, and Medicine, prior to the 

draft legislation moving forward (www.patenteligibility. 
com). This study should be initiated soon, with a short 
timeline, to ensure that appropriate information and 
analysis can inform the legislative processes that will 
substantially affect how the 
benefits of science and inno- 
vation are delivered to society. 
Patents balance providing 
incentives to take the finan- 
cial risks necessary to con- 
vert an invention into useful 
products with the benefits of 
sharing information to drive 
other useful inventions. Since 
the advent of DNA sequencing 
and gene identification meth- 
ods, patenting human genes 


been deemed patentable in their purified forms because of 
the enhancement of their activities through purification. 
Natural products have been quite important as drugs or 
in driving drug discovery, such as the immunosuppressive 
compounds cyclosporine and tacrolimus (FK506), which 
enabled solid organ transplantation. 

Isolation of natural substances also enables “composi- 
tion of matter” patents. These are valuable because they 
cover the material regardless of its use or the processes 
in which they are components. The Myriad patents in- 
volved the composition of matter of the BRCAI/2 genes. 
The Supreme Court ruled that 
these substances were not eli- 
gible as exceptions from the 
product of nature exclusion 
because isolation of the BRCA 
genes from their chromo- 
somal environment did not 
transform their character suf- 
ficiently. Moreover, the item of 
value was information in the 
gene sequences rather than 
the materials themselves. 
However, the language in the 


has been controversial. A no- 
table example involves pat- 
ents for the genes BRCAI and 
BRCA2, variations in which 
modulate risks for breast and 
ovarian cancer. These patents 
supported increased costs and 
hence, limited accessibility, 
for diagnostic tests for cancer 
patients and their families. 
In 2013, the U.S. Supreme Court ruled that these pat- 
ents were invalid (Association for Molecular Pathology v. 
Myriad Genetics). The Myriad decision shifted the land- 
scape not only for gene patents but also for patents on 
other “products of nature.” The current bill is directed 
toward clarifying the concepts and language underlying 
the issues involved in patenting products of nature. It is 
essential that these issues be examined thoroughly and 
thoughtfully prior to enshrining language in law that 
could easily have unintended consequences. 

To avoid overly restricting research, there are certain ex- 
clusions from patentability, namely for abstract ideas, nat- 
ural laws, and products of nature. However, for products 
of nature, exceptions have been made for identifying and 
isolating substances whose properties differ substantially 
from the source from which they were obtained. For more 
than a century, natural products, such as adrenaline, have 


“patenting human 
genes has been controversial.” 


Court opinions was subject 
to different interpretations. 
Some have interpreted them 
as applying only to the patent- 
ing of genes, whereas others 
suggested that they change 
the scope of natural products 
that can be patented. 

Because of the great im- 
portance of these somewhat 
esoteric issues for facilitating the generation of new diag- 
nostics and drugs and for advancing basic research, it is 
crucial that appropriate steps be taken to guide modifica- 
tion of existing patent law. At this point, case law based 
on particular court rulings is controlling the evolution 
of these patent rulings. However, a more intentional ap- 
proach implemented through legislation has the potential 
to address these issues in a more comprehensive manner. 
This task should be informed by a thorough process that 
the National Academies can effectuate—bringing together 
stakeholders including academic scientists, industry lead- 
ers, patent experts, and patient advocates. Striking the 
right balance with language that clearly conveys the in- 
tended interpretations should strengthen patent protec- 
tion, bolster science, and improve societal well-being. 


—Jeremy Berg 
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Editor-in-Chief, 
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44 Ending the manel begins at the top. 99 


Francis Collins, director of the U.S. National Institutes of Health, saying he will not 
appear on all-male public panels—“manels”—common at scientific meetings. 


Awoman in Mpondwe, Uganda, washes with chlorinated water as a precaution against Ebola. 


INFECTIOUS DISEASE 
As Ebola spreads, no emergency call 


nacontroversial decision, the World Health Organization (WHO) in 

Geneva, Switzerland, decided last week not to declare Africa’s con- 

tinuing Ebola outbreak, which has killed more than 1400 people, a 

Public Health Emergency of International Concern (PHEIC), even 

as the disease moved from the Democratic Republic of the Congo 

(DRC) into a neighboring country. Two people died in Uganda last 
week after crossing the border from the DRC, which has battled the 
outbreak since August 2018. Some public health experts argued that 
a declaration would focus global attention on what is now the largest 
outbreak of Ebola since the one that ravaged West Africa 5 years ago; 
a PHEIC would allow WHO to share information about the disease’s 
spread without the affected countries’ consent and make temporary 
recommendations that member states must follow. But the outbreak 
has not satisfied all PHEIC criteria, said Preben Aavitsland, acting chair 
of a WHO expert committee. He said a declaration could be counter- 
productive because countries are already sharing information and fol- 
lowing WHO’s advice, and a PHEIC could lead to restrictions on travel 
and trade that might hamper public health efforts in the region. Even 
so, he and the critics agree that the international community must pro- 
vide more funding and resources to control the outbreak. 
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Trump targets advisory panels 


SCIENCE POLICY | U.S. President Donald 
Trump thinks the U.S. government has too 
many expert committees giving strategic 
advice to federal research agencies. That’s 
the gist of a 14 June executive order that 
gives each of those agencies, including 
NASA, the National Science Foundation, 
and the Department of Energy’s Office of 
Science, until 30 September to wipe out 
at least one-third of its top-level advisory 
panels. The order exempts panels at the 
National Institutes of Health and other 
agencies that review grant proposals or 
help ensure the safety and efficacy of 
drugs, food, and other consumer products. 
It also excludes—for now—panels created 
by law rather than agency edict, although 
the order invites agencies to propose a 
hit list of those statutory committees in 
their upcoming budget requests. A list 

of hundreds of these committees is avail- 
able at http://bit.ly/2INdueZ. 


NIH workers report harassment 


#METOO | More than one in five employ- 
ees at the U.S. National Institutes of 
Health in Bethesda, Maryland, said they 
were sexually harassed at work recently, 
according to interim results from a survey 
released last week, to which nearly 16,000 
NIH employees and agency contractors 
responded. Eighteen percent of the respon- 
dents to the survey, conducted between 
January and March, said that in the prior 
12 months, they had experienced gender 
harassment such as sexist put-downs, 
10.3% reported receiving unwanted sexual 
attention, and 0.3% said they were subject 
to sexual coercion. In all, 21.6% said they 
had coped with one or more of these three 
forms of sexual harassment. 


China regulates genetic material 


GENETICS | China last week announced 
national regulations for study and 
transport of human biological material 
containing DNA. The government intends 
the policies to encourage R&D of drugs 
and medical treatments. But the rules 
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CLIMATE écianee 
Abnormal heat causes - 
early Greenland melt 


he melting season in Greenland abruptly accelerated last 
week, as Steffen Olsen, a climate scientist at the Danish 
Meteorological Institute in Copenhagen, and a colleague (above) 
discovered when they set off on dog sleds to retrieve research 
equipment from a fjord on Greenland’s northwest coast. 

An unusual spell of heat, with temperatures briefly more than 20°C 


are also meant to protect patient privacy 
and control of the material: One of the 
rules requires foreign scientists to have 
Chinese collaborators to access and use the 
resources and data. That reflects concern 
by the government that foreign research- 
ers have collected and removed biological 
materials from China for study without 
permission. The rules cover the collection, 
storage, and use of organs, tissues, and 
cells as well as the underlying genetic 
data. They also specify fines of up to 

5 million Chinese yuan (about $724,000) 
for violations. 


New York ends vaccine exemption 


PUBLIC HEALTH | New York state last week 
eliminated an exemption that had allowed 
parents to refuse to vaccinate their school- 
attending children because of “genuine and 
sincere religious beliefs.” State lawmakers 
in Albany acted despite vocal protests by 
scores of vaccination opponents at the 
Capitol. Governor Andrew Cuomo immedi- 
ately signed the bill ending the exemption, 
calling it a response to a public health crisis. 
The state has been the epicenter of a U.S. 
measles epidemic this year, logging more 
than 850 of the 1044 cases nationwide as of 
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13 June. Most of New York state’s cases have 
occurred in ultra-Orthodox Jewish com- 
munities. New York joins five other states 
that bar religious exemptions: Arizona, 
California, Maine, Mississippi, and West 
Virginia. New York requires vaccination of 
children who attend any public or private 
daycare, preschool, or school. 


Panel: Keep Landsat images free 


EARTH SCIENCE | A federal advisory panel 
last week called on the U.S. Department 
of the Interior (DOT) to maintain free 
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This false-color photo taken by Landsat-8 captured a 


developing “snow swamp” in Canada’s Yukon territory. 
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above normal, left meltwater on top of ice and set more than 40% 
of Greenland's entire ice sheet melting at its surface, an extent seen 
only at the peak of summer last year. Human-driven climate change 
has warmed the Arctic more than any other region, and researchers 
fear this summer could rival records set in 2012, when a July heat 
wave drove surface melt across nearly all of Greenland’s ice sheet. 


and public access to widely used imagery 
from its Landsat satellite program. DOI 
could decide nevertheless to charge fees 
for access, but is not expected to. Begun 
in 1972, Landsat has produced the world’s 
longest-running series of satellite images, 
used by academics and industry to docu- 
ment global change. In 2017, DOI asked the 
National Geospatial Advisory Committee 
to evaluate the possibility of charging for 
Landsat data, which the government did 
until 2008. Such a move would be unwise, 
the panel concluded, driving users to free 
alternatives and generating little money. 


U.K. integrity watchdog gears up 


RESEARCH ETHICS | The United Kingdom’s 
main funding agency, UK Research and 
Innovation (UKRD), is creating a watch- 
dog to oversee research integrity. Last 

year, a Parliament committee called for 
national oversight, citing concerns that 
universities were not fully complying with 
requirements to report on their internal 
investigations of research misconduct. 
UKRI is establishing an independent entity 
to make sure institutions follow “appropri- 
ate processes to investigate misconduct,” 
according to the agency’s annual strategic 
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SPACE STATION China announced nine 
scientific experiments it will run on its 
new space station, scheduled for comple- 
tion in 2022. Projects were selected in 

an international competition and involve 
scientists from 17 countries, but not the 
United States. Topics include effects of 
space flight on tumor formation in human 
organoid tissue. 


FETAL TISSUE The Democratic-controlled 
U.S. House of Representatives amended 
a proposed 2020 spending bill to block 

a key part of President Donald Trump's 
administration's new policy that restricts 
U.S.-funded research with fetal tissue 
donated after elective abortions. 

The amendment prohibits funding ethics 
advisory boards to review new fetal 
tissue projects. 


NEW WEED KILLERS Bayer AG in 
Leverkusen, Germany, announced it will 
spend €5 billion over the next decade on 
research to develop weec-killing alterna- 
tives to its controversial, widely used 
herbicide glyphosate. Countries in Europe 
have been moving toward banning the 
product, and Bayer faces multimillion- 
dollar U.S. jury verdicts from lawsuits 
alleging that it caused illnesses. 


AGENCIES MOVED The U.S. Department 
of Agriculture has picked the Kansas City, 
Missouri, region to host two research 
units that many scientists say should be 
kept in Washington, D.C. “[Agriculture] 
Secretary [Sonny] Perdue is well on 

his way to dismantling one of the best 
agricultural economics research institu- 
tions in the world,” says Ron Wasserstein 
of the American Statistical Association 

in Alexandria, Virginia, about one of the 
units, the Economic Research Service. 
The other is the National Institute of Food 
and Agriculture. 


DOCUMENTARY QUESTIONED Cell bio- 
logists voiced qualms about being shown in 
a documentary touting stem cell therapies 
after discovering the project was funded 

by for-profit companies under investiga- 
tion for offering the treatments, the San 
Francisco Chronicle reported. Makers of 
The Healthcare Revolution removed Jeanne 
Loring, a professor emeritus at Scripps 
Research in San Diego, California, from the 
film at her request. No stem cell therapy 
sold at for-profit clinics has been approved 
by the U.S. Food and Drug Administration. 


S SCIENCEMAG.ORG/NEWS 
Read more news from Science online. 


1116 21 JUNE 2019 + VOL 364 ISSUE 6446 


plan released last week. UKRI provided 

no additional details but says it will issue 
an update this summer. The strategic plan 
also says UKRI will create an ethics state- 
ment, evaluate training of Ph.D. students 
in research integrity, and fund research on 
how academic incentives could be adjusted 
to support research integrity. 


Fecal transplant risks arise 


CLINICAL RESEARCH | After a patient 
died from an infection acquired during a 
fecal transplant, the U.S. Food and Drug 
Administration (FDA) last week issued an 
alert to clinicians about the procedure’s 
risks and suspended an undisclosed num- 
ber of clinical trials. Inserting a healthy 
person’s stool into a patient’s colon has 
shown dramatic results in clearing intes- 
tinal infections caused by the bacterium 
Clostridium difficile; the method is also 
under investigation to treat other illnesses. 
But in two cases described in the agency’s 
13 June warning, a donor’s stool was not 
screened for other dangerous bacteria and 
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contained drug-resistant Escherichia colt. 
Two immunocompromised recipients 
developed infections, and one died. FDA 
is now requiring clinical researchers to 
establish procedures to screen donors and 
their stools before resuming the trials. 


Dutch college seeks only women 


DIVERSITY | The Eindhoven University of 
Technology (TUE) in the Netherlands said 
this week it is taking radical action to hire 
more female academics by opening job 
vacancies to women only. The engineering 
institution says it will not allow men to 
apply for permanent academic jobs for the 
first 6 months of the recruitment process 
under a new fellowship program. Dutch 
and EU laws allow policies to recruit 
underrepresented groups, TUE says. But 
only 29% of TUE’s assistant professors are 
women; about 15% are at the associate and 
full professor level. The university wants 
women to make up 50% of its assistant 
and associate professor positions and 35% 
of its full professors. 


Global survey finds strong support for scientists 


early three-quarters of people worldwide solidly trust scientists: That’s one of the main 
findings of the Wellcome Global Monitor, a new survey that asked 140,000-plus people 


in more than 140 countries how they think and feel about health and science. Other polls 


have asked similar questions, but this one, conducted by Gallup World Poll on behalf of 


London-based biomedical charity the Wellcome Trust, claims to be the first to study ona 


global scale how attitudes vary by nationality, gender, income, and education. An index based 
on five questions found that 54% of people trust scientists at a medium level and 18% ata 
high level, whereas 14% have a low level of trust. But regional differences are striking: People 
in Uzbekistan say they trust scientists the most, residents of Gabon the least. On the 
benefits of science, more than one-third of people in southern Africa and Latin America say 
science helps “very few” people in their country. The survey also explored attitudes toward 


vaccines and found people in France are the most skeptical (see p. 1122). The survey reveals a 
global gender gap in self-assessments of scientific knowledge: Globally, 49% of men say they 


know “some” or “a lot” about science—a full 11 percentage points more than women. 


Level of trust in scientists, by selected regions 


World 18% 


Western Europe 24% 


Northern Europe 33% 


Eastern Europe 15% 


United States 


and Canada 26% 


Central Asia 


South America 13% 


Central Africa 12% 


Percentages are rounded and may not add up to 100%. 


Published by AAAS 


@ High @ Medium @ Low 


s 


13% 


CREDITS: (GRAPHIC) A. CUADRA/SCIENCE; (DATA) HTTPS://WELLCOME.AC.UK/ WHAT-WE-DO/OUR-WORK/WELLCOME-GLOBAL-MONITOR 


© 
s 


© 
s 


20% 


sciencemag.org SCIENCE 


Don't know/don't answer 


a 
s 


PHOTO: STEPHEN VOSS 


Francis Collins, director of the National Institutes of Health, says he’s pleased with bold recommendations to help the agency curb sexual harassment. 


SCIENCE POLICY 


NIH prepares to toughen harassment rules 


Advisory group urges mandatory reporting of #MeTooSTEM investigations to agency 


By Jocelyn Kaiser 


ast year, on the heels of several high- 

profile cases of sexual harassment 

in science, the National Institutes 

of Health (NIH) came under fire for 

standing by while another major U.S. 

research funding agency, the National 
Science Foundation (NSF), quickly tight- 
ened its policies. But now, NIH officials ap- 
pear ready to act—and even to go beyond 
NSF’s rules. NIH leaders said last week 
that they plan to move forward with four 
recommendations from a group advising 
the agency, including hard-hitting report- 
ing requirements aimed at making sure 
NIH doesn’t unknowingly fund researchers 
found guilty of sexual harassment. 

NIH Director Francis Collins said he wel- 
comed the advice from the 14 outside scien- 
tists and seven NIH staff members who had 
been meeting since February: “I’m happy 
the recommendations are quite bold; he 
said after the group presented to his Advi- 
sory Committee to the Director at NIH in 
Bethesda, Maryland, on 13 June. Collins later 
announced NIH would “be pursuing mecha- 
nisms and approaches to implement these 
interim recommendations, where feasible.” 

The group’s report is drawing praise from 
the #MeTooSTEM movement and others. 
“This is a very dramatic shift. ’m jump- 


SCIENCE sciencemag.org 


ing up and down about the progress,” says 
Kathy Hudson, formerly the senior policy 
official at NIH and now a consultant in 
Washington, D.C. But she and others hope 
the group’s final recommendations, due in 
December, will go even further. 

NSF last fall began to require that insti- 
tutions report when a principal investigator 
(PI) has been found guilty of sexual harass- 
ment or when institutions take actions such 
as putting a scientist on leave as they inves- 
tigate allegations of such misconduct. The 
goal was to avoid repeating past cases in 
which the agency learned belatedly, through 
media reports, that scientists it was funding 
had been found to have harassed women. 
But NIH held off on new policies and instead 
appointed the working group to explore pos- 
sible changes. 

Topping the list of its proposed changes 
is that sexual harassment be treated “as 
seriously as research misconduct,’ a recom- 
mendation that comes from a 2018 National 
Academies of Sciences, Engineering, and 
Medicine (NASEM) report (Science, 15 June 
2018, p. 1159). That would mean establish- 
ing rules parallel to those now requiring 
institutions to inform the Department of 
Health and Human Services about research 
misconduct cases. Specifically, the group 
says, NIH should require that institutions 
tell the agency about any finding or inves- 
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tigation of alleged misconduct—including 
sexual harassment—within 1 week after an 
investigation begins or findings are issued. 

The advisers also recommended that PIs 
and co-PIs themselves “attest” on grant ap- 
plications and progress reports that they 
have not violated and will not violate their 
institution’s code of conduct. This second 
reporting step goes beyond NSF’s guide- 
lines. The group proposed specific ques- 
tions asking whether the applicant has been 
found guilty of or been involved in a legal 
settlement concerning sexual harassment 
or other misconduct within the past 7 years. 
The idea is to encourage good behavior as 
well as to potentially prevent some harass- 
ers from continuing to get NIH funding. 
Francis Cuss, a former Bristol-Myers Squibb 
executive in New York City who co-chaired 
the working group, said at the meeting that 
responses would go to NIH staff, who would 
consider whether to deny or revoke fund- 
ing or impose other penalties on a case-to- 
case basis. 

“Right now NIH has no way to know” 
about harassment allegations, says biologist 
Carol Greider of Johns Hopkins University 
in Baltimore, Maryland, part of the advisory 
group. “To know it’s happening ... is a large 
step in the right direction.” 

NIH should also take steps to restore the 
careers of sexual harassment survivors, the 
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group says, for example by encouraging them 
to apply to programs that help research- 
ers restart their careers after dropping out 
for reasons such as having a baby. Finally, 
NIH should give trainees more control over 
their financial support, perhaps by giving 
more awards directly to individual trainees, 
rather than to institutions or mentors. 

The time limit on reporting past mis- 
deeds was set so that a reformed harasser 
would not be “branded for life,” explained 
working group co-chair and NIH Associate 
Director for Science Policy Carrie Wolinetz 
at the meeting. The group will examine the 
7-year limit before it issues a final report in 
December, added co-chair Kristina Johnson, 
chancellor of the State University of New 
York system. It will also explore whether the 
reporting policies should apply to staff who 
are not PI or co-PI on a grant. 

The recommendations “while important, 
will only catch a tiny minority of harassers,” 
predicts Lilia Cortina, a psychologist at the 
University of Michigan in Ann Arbor who 
helped write the NASEM report. She hopes 
NIH will also have institutions survey labs 
it funds to assess whether sexual harass- 
ment is happening, as it has recently done 
for its intramural researchers (see p. 1114). 

NIH officials said previously that adopt- 
ing a reporting policy like NSF’s could re- 
quire a lengthy, formal rulemaking process. 
Greider thinks that bureaucratic obstacle 
can be surmounted. “I do hope the working 
group goes further,” she says. “We will have 
until December to work on a lot of this.” 

One prominent #MeTooSTEM activist was 
pleased with the recommendations. “There 
are a lot of good things,” including that NIH 
will be more directly involved in sanction- 
ing harassers, says BethAnn McLaughlin of 
Vanderbilt University in Nashville, who has 
sharply criticized NIH for its lack of action. 

McLaughlin also finds new data NIH 
shared at the meeting last week to be “hugely 
inspiring.” The numbers show that disci- 
plinary actions involving sexual harass- 
ment are up, suggesting NIH is taking the 
problem seriously. So far this year, NIH says 
it has reviewed 31 cases of possible harass- 
ment (more than in all of 2018), removed 
five scientists from grants and stopped us- 
ing 19 peer reviewers. 

Congress is also moving to reconcile sexual 
harassment policies across federal science 
agencies. This week, the science committee 
in the U.S. House of Representatives was 
expected to approve a measure intended to 
push the largest federal research agencies, 
including NIH, toward a “uniform set of 
policies” on sexual harassment. 


With reporting by Meredith Wadman and 
Jeffrey Mervis. 


1118 = 21 JUNE 2019 + VOL 364 ISSUE 6446 


or 


CHINA 


Hong Kong scientists protest, 
but also forge mainland ties 


Cross-border research collaborations expand, even as conflict 
continues over city’s semiautonomous status 


By Dennis Normile 


fter a series of massive protests by 

Hong Kong’s residents, including 

many academics, the leaders of the 

semiautonomous Chinese city last 

week shelved controversial legislation 

that would have allowed people there 
to be extradited to mainland China. But even 
as that battle to preserve independence con- 
tinues, Hong Kong’s researchers are forging 
closer ties with the mainland. 

Those links will be strengthened this year, 
with several new cross-border funding pro- 
grams set to make their first awards. And 
although many researchers welcome the new 
opportunities for funding and collaboration, 
some worry they could give Beijing greater 
influence over Hong Kong’s research agenda. 

The tension arises from Hong Kong’s spe- 
cial political status. In 1997, China regained 
control of the former U.K. colony under a 
“one country, two systems” policy that gives 
Hong Kong’s 7.4 million residents a greater 
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say in their economic and political affairs. 
Academic efforts have thrived under the ar- 
rangement. The city now hosts nearly 30,000 
researchers, creating a per capita ratio triple 
that found on China’s mainland, according 
to United Nations statistics. Hong Kong’s 
research spending has risen from just 0.4% 
of its gross domestic product in 1998 to 0.8% 
in 2017. Several of the city’s universities are 
among the top 50 in the world, according to 
this year’s Times Higher Education rankings. 

Research ties with mainland China have 
grown since the handover. In 1998, 16.5% of 
all scientific papers produced in Hong Kong 
involved collaborations with the mainland; 
by 2017, the share had jumped to 53.2%, in- 
formation scientists Ma Qian and Li Wenlan 
of Tianjin University in China reported in 
September 2018 in Scientometrics. 

Funding ties are also deepening. Hong 
Kong researchers have long been able to win 
grants from the Chinese government, but the 
money had to be spent within the mainland. 
Last year, however, the government dropped 
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Two million people, a quarter of Hong Kong, China’s 
residents, joined protests against an extradition bill. 


that requirement at the request of prominent 
Hong Kong scientists. In the first grants un- 
der the new policy, China’s Ministry of Sci- 
ence and Technology (MOST) in May 2018 
awarded 22 million Chinese yuan ($3.2 mil- 
lion) to 22 Hong Kong research groups. 

Two new cross-border programs will an- 
nounce their first grants later this year. One 
is backed by MOST and Hong Kong’s Innova- 
tion and Technology Bureau, and the other 
by Hong Kong’s Research Grants Council and 
China’s National Natural Science Foundation. 
The latter will focus on six research areas, in- 
cluding medicine and materials science. 

Even as Hong Kong has strengthened 
scientific ties with the mainland, questions 
about the durability of the city’s special sta- 
tus have grown. The current protests began 
soon after the extradition bill was unveiled 
in April. Opponents said it would enable 
mainland authorities to seize political op- 
ponents on flimsy charges. And more than 
1700 academics from around the world 
voiced support for their Hong Kong col- 
leagues by signing an online petition warn- 
ing that the bill was “jeopardizing the rule 
of law and human rights in Hong Kong.” On 
15 June, Hong Kong officials “indefinitely” 
postponed action on the bill. 

The bill was “definitely a concern for 
academics,” because it could have had “a 
chilling effect on people working on so- 
called ‘sensitive’ topics,’ says philosopher 
Timothy O’Leary of the University of New 
South Wales in Sydney, Australia, who taught 
at the University of Hong Kong (HKU) for 
17 years. Some scientists also worried the 
law would hamper recruitment efforts. 

Authorities on both sides argue cross- 
border collaborations advance science and 
help Hong Kong become an innovation 
hub. But such schemes are also “a very use- 
ful mechanism” for integrating Hong Kong 
into China, notes one senior Hong Kong 
scientist, who asked to remain anonymous 
because of the issue’s sensitivity. Even so, 
he says it would be a leap from tighter inte- 
gration “to Hong Kong institutions losing 
their autonomy.” 

Other researchers are even more sanguine. 
HKU microbiologist Yuen Kwok-Yung doubts 
“such additional funding will erode academic 
freedom in [Hong Kong] as long as ... the 
independent judiciary and free press are 
still being protected.” And O’Leary says the 
recent protests show that Hongkongers “will 
not easily acquiesce to an encroachment on 
their civil liberties.” But he urges the city’s 
universities to follow the protesters’ lead 
“and continue to insist on the nonnegotiable 
importance of academic freedom.” § 
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Medicine contends with how 
to use artificial intelligence 


Barriers include lack of reproducibility across hospitals 


and populations 


By Jennifer Couzin-Frankel 


rtificial intelligence (AI) is poised 
to upend the practice of medicine, 
boosting the efficiency and accuracy 
of diagnosis in specialties that rely 
on images, such as radiology and 
pathology. But as the technology gal- 
lops ahead, researchers and physicians are 
grappling with its potential downsides. “Just 
working with the technology, I see lots of 
ways it can fail” says Albert Hsiao, a radio- 
logist at the University of California, San 
Diego, who has developed 
algorithms for reading 
cardiac images and im- 
proving their quality. One 
major concern: Most AI 
software is designed and 
tested in one _ hospital, 
and it risks faltering when 
transferred to another. 

U.S. government §sci- 
entists, regulators, and 
doctors last month pub- 
lished a road map de- 
scribing how to convert 
research-based AI into 
improved medical imag- 
ing for patients. Among 
other things, the com- 
mentary in the Journal 
of the American College 
of Radiology urged more 
collaboration across dis- 
ciplines in building and 
testing AI algorithms, 
and intensive validation 
of algorithms before they 
reach patients. For now, 
Hsiao says, “I would want a human physi- 
cian no matter what,’ even if a machine 
hums alongside. 

Most AI in medicine is used in research, 
but regulators have already approved some 
algorithms for radiologists. Physicians are 
also developing their own—which they’re 
permitted to use without regulatory ap- 
proval as long as companies aren’t market- 
ing the new technology. Studies are testing 
algorithms to read x-rays, detect brain 
bleeds, pinpoint tumors, and more. 
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Inachest x-ray (top), artificial intelligence 
marks a likely infection (bottom). 


The algorithms learn as scientists feed 
them hundreds or thousands of images— 
of mammograms, for example—training 
the technology to recognize patterns faster 
and more accurately than a human could. 
“If ’m doing an MRI of a moving heart, I 
can have the computer predict where the 
heart’s going to be in the next fraction of 
a second and get a better picture instead 
of a blurry” one, says Krishna Kandarpa, 
a cardiovascular and interventional radio- 
logist at the National Institute of Bio- 
medical Imaging and Bioengineering in 
Bethesda, Maryland. 

Or AI might analyze 
computerized tomogra- 
phy head scans of sus- 
pected strokes, label those 
more likely to harbor 
a brain bleed, and put 
them on top of the pile 
for the radiologist to 
examine. An algorithm 
could help spot breast 
tumors in mammograms 
that a radiologist’s eyes 
risk missing. 

But Eric Oermann, a 
neurosurgeon at Mount 
Sinai Hospital in New 
York City, has explored 
one downside of the al- 
gorithms: The signals 
they recognize can have 
less to do with disease 
than with other pa- 
tient characteristics, the 
brand of MRI machine, 
or even how a scanner is 
angled. With colleagues, 
Oermann developed a mathematical model 
for detecting patterns on chest x-rays con- 
sistent with pneumonia and trained it with 
images from patients at Mount Sinai. The 
hospital has a busy intensive care unit with 
many elderly people, who are often admit- 
ted with pneumonia; 34% of the Mount 
Sinai x-rays came from infected patients. 

When the algorithm was tested on a 
different batch of Mount Sinai x-rays it 
performed admirably, accurately detect- 
ing pneumonia 93% of the time. But 


21 JUNE 2019 + VOL 364 ISSUE 6446 1119 


NEWS | IN DEPTH 


Oermann also tested it on tens of thou- 
sands of patient images from two other 
sites: the National Institutes of Health 
Clinical Center in Bethesda and the Indi- 
ana Network for Patient Care. With x-rays 
from those locations—where pneumonia 
rates just squeaked past 1%—the success 
rate fell, ranging from 73% to 80%, the 
team reported last year in PLOS Medicine. 
“Tt didn’t work as well because the patients 
at the other hospitals were different,” 
Oermann says. 

At Mount Sinai, many of the infected 
patients were too sick to get out of bed, 
and so doctors used a portable chest x- 
ray machine. Portable x-ray images look 
very different from those created when a 
patient is standing up. Because of what it 


learned from Mount Sinai’s x-rays, the al- 
gorithm began to associate 
a portable x-ray with illness. 
It also anticipated a high 


“Just working 


patients at two Dutch centers. Then, he 
invited all comers to train their algorithm 
on 270 of those slides and test it on the 
remaining 130, to see whether it could do 
better than pathologists hunting for tiny 


cancers. Twenty-three teams submitted 
32 algorithms. 
The results, published in 2017 in 


JAMA, showed that 10 matched or ex- 
ceeded a panel of 11 pathologists. The top- 
performing algorithm, from a group at 
Harvard University and the Massachu- 
setts Institute of Technology in Cambridge, 
matched a pathologist who took an entire 
weekend to go over 130 slides. 

“To actually see that you could be as good 
as a pathologist [was] a shock,’ van der Laak 
says. He and others say that to achieve that 
kind of accuracy, AI algorithms should train 
on data that are diverse not 
only in their hospital of origin, 
but also in race and geography, 


rate of pneumonia, boost- with the because disease can manifest 
ing misdiagnoses. differently across populations. 

Few multisite studies like technology, The U.S. Food and Drug 
Oermann’s have been pub- T see lots of Administration (FDA) con- 
lished, and last month’s road e ez 99  tinues to weigh how to as- 
map deemed this worrying. way: Sit can fail. sess algorithms for patient 
This year, a South Korean Albert Hsiao, care. The agency consid- 
team reported in the Ko- University of California, ers “locked” AI _ software, 
rean Journal of Radiology San Diego which is unchanging, as 


an analysis of 516 studies of 

AI algorithms designed to interpret medi- 
cal images. The authors found that just 
6% of the studies tested their algorithm at 
more than one hospital. “It’s very concern- 
ing,” says Elaine Nsoesie, a computational 
epidemiologist at Boston University. Even 
the brand of scanner matters, as the pixel 
pattern can vary, disrupting how AI as- 
sesses the image. 

One way to avoid this pitfall, Nsoesie 
says, is to test an algorithm using data 
from several hospitals. Researchers are 
starting to do this, she says, “but less than 
you would think.” A rare example is an al- 
gorithm first trained and tested on data at 
Stanford University’s Lucile Packard Chil- 
dren’s Hospital in Palo Alto, California, and 
Children’s Hospital Colorado in Aurora. 
It’s now undergoing testing in a clinical 
trial on scans from nine sites. The soft- 
ware measures skeletal maturity in hand 
x-rays, which orthopedists use to guide 
treatment for growth disorders in children 
and teenagers. 

In pathology, another field that relies on 
images, Jeroen van der Laak, a computer 
scientist at Radboud University Medical 
Center in the Netherlands, tried a new way 
to encourage researchers to test their algo- 
rithms across hospitals: a competition. In 
2015, van der Laak gathered and digitized 


400 lymph node slides from breast cancer 
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a medical device. But ear- 
lier this year, it announced it was devel- 
oping a framework for regulating more 
cutting-edge AI software that continues 
to learn over time. Still, there “are major 
questions everyone is struggling with” 
around regulation, says Hugo Aerts, who 
directs the computational imaging and 
bioinformatics laboratory at the Dana- 
Farber Cancer Institute in Boston. What if 
developers update an algorithm that works 
in 96% of cases to achieve a 99% success 
rate; do they have to go through the regu- 
latory process again? What if an approved 
algorithm is applied to a patient popula- 
tion it wasn’t originally tested on? 

FDA has already issued some approvals. 
One algorithm, created by Hsiao, measures 
heart size and blood flow in a cardiac MRI. 
Hsiao was frustrated that analyzing the 
data by hand took several hours, so he re- 
turned to his roots as a computer science 
major and wrote his own software. He 
subsequently formed a company, Arterys, 
based in San Francisco, California, and 
won FDA approval in about 6 months, he 
says. Hsiao is now working on algorithms 
that help identify pneumonia infections in 
the lungs. 

But, he says, the doctor, not the machine, 
is still the boss, and entitled to override the 
technology. “If I think it’s not pneumonia,” 
Hsiao says, “it’s not.” 
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OCEAN SCIENCE 


Fight for the 
Arctic Ocean 
is a boon 
for science 


Armed with seafloor maps, 
Russia, Denmark, and 
Canada seek control over 
the North Pole 


By Richard Kemeny 


competition for the North Pole heated 
up last month, as Canada became the 
third country to claim—based on ex- 
tensive scientific data—that it should 
have sovereignty over a large swath of 
the Arctic Ocean, including the pole. 

Canada’s bid, submitted to the United 
Nations’s Commission on the Limits of the 
Continental Shelf (CLCS) on 23 May, joins 
competing claims from Russia and Denmark. 
Like theirs, it is motivated by the prospect 
of mineral riches: the large oil reserves be- 
lieved to lie under the Arctic Ocean, which 
will become more accessible as the polar ice 
retreats. And all three claims, along with 
dozens of similar claims in other oceans, rest 
on extensive seafloor mapping, which has 
proved to be a boon to science, whatever the 
outcome for individual countries. The race to 
obtain control over parts of the sea floor has 
“dramatically changed our understanding of 
the oceans,” says marine geophysicist Larry 
Mayer of the University of New Hampshire 
in Durham. 

Coastal nations have sovereign rights over 
an exclusive economic zone (EEZ), extending 
by definition 200 nautical miles (370 kilo- 
meters) out from their coastline. But the 1982 
United Nations Convention on the Law of 
the Sea opened up the possibility of expand- 
ing that zone if a country can convince CLCS 
that its continental shelf extends beyond the 
EEZ’s limits. 

Most of the 84 submissions so far were 
driven by the prospect of oil and gas, al- 
though advances in deep-sea mining technol- 
ogy have added new reasons to apply. Brazil, 
for example, filed an application in Decem- 
ber 2018 that included the Rio Grande Rise, a 
deep-ocean mountain range 1500 kilometers 
southeast of Rio De Janeiro that’s covered in 
cobalt-rich ferromanganese crusts. 
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To make a claim, a country has to submit 
detailed data on the shape of the sea floor and 
on its sediment, which is thicker on the shelf 
than in the deep ocean. The data come from 
sonar, which reveals seafloor topography, and 
seismic profiling, which uses low-frequency 
booms to probe the sediment. Canada’s bid 
also enlisted ships to conduct high-resolution 
gravimetry—measurements of gravity anom- 
alies that reveal seafloor structure. Elevated 
gravity readings are found over higher- 
density mantle rocks found in oceanic crust, 
and lower readings over lighter, continen- 
tal structures. And the bid used analyses of 
800 kilograms of rock samples dredged up 
from the sea floor, whose composition can 
distinguish continental from ocean crust. 

The studies don’t come cheap; Canada’s 
17 Arctic expeditions alone cost more than 
CA$117 million. But the work by the three 
countries vying for the Arctic—and that of 
dozens of others elsewhere in the world—has 
been a bonanza for oceanography. In the Arc- 
tic alone, the mapping has revealed several 
sunken mountains, previously missed or un- 
detected by older sonar methods. Hundreds 
of pockmarks found on the Chukchi Cap, 
a submarine plateau extending out from 
Alaska, suggest that bursts of previously fro- 
zen methane have erupted from the seabed, 
a phenomenon that could accelerate climate 
change. And gaps discovered across subma- 
rine ridges allow currents to flow from basin 
to basin, with “important ramifications on 
the distribution of heat in the Arctic and on 
overall modeling of climate and ice melting,” 
Mayer says. 

CLCS, composed of 21 scientists in fields 
such as geology and hydrography who are 
elected by member states, has accepted 
24 of the 28 claims it has finished evalu- 
ating, some partially or with caveats; in 
several cases, it has asked for follow-up sub- 


Who owns the North Pole? 


Countries can claim the sea floor beyond the 200-nautical-mile 
(370-kilometer) exclusive economic zone (EEZ) if data show it 
to be an extension of the continental shelf (below). Russia, 
Denmark, and Canada have submitted overlapping claims in 


the Arctic Ocean (right). 


Coastline 


EEZ Outer limit of 
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continental shelf 
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Canadian and U.S. Coast Guard ships worked together to map the Arctic sea floor for continental shelf claims. 


missions with more data. Australia was the 
first country to succeed, adding 2.5 million 
square kilometers to its territory in 2008. 
New Zealand gained undersea territory six 
times larger than its terrestrial area. But 
CLCS only judges the merit of each indi- 
vidual scientific claim; it has no authority 
to decide boundaries when claims overlap. 
To do that, countries have to turn to diplo- 
matic channels once the science is settled. 
The three claims on the North Pole revolve 
around the Lomonosov Ridge, an under- 
water mountain system that runs from 
Ellesmere Island in Canada’s Qikiqtaaluk 
region to the New Siberian Islands of Rus- 
sia, passing the North Pole. Both countries 
claim the ridge is geologically connected to 
their continent, whereas Denmark says it is 
also tied to Greenland, a Danish territory. 
As the ridge is thought to be continental 
crust, the territorial extensions could be ex- 
tensive. (U.S. scientists should finish map- 
ping in the Arctic in about 2 years, says 
Mayer, who is involved in that effort, but as 
one of the few countries that hasn’t ratified 
the Law of the Sea convention, the United 
States can’t file an official submission.) 
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Tensions flared when Russia planted a tita- 
nium flag on the sea floor beneath the North 
Pole in 2007, after CLCS rejected its first 
claim, saying more data were needed. The Ca- 
nadian foreign minister at the time likened 
the move to the land grabs of early European 
colonizers. Not that the North Pole has any 
material value: “The oil potential there is zip,” 
says geologist Henry Dick of the Woods Hole 
Oceanographic Institution in Massachusetts. 
“The real fight is over the Amerasian Ba- 
sin,” Dick says (see map, below) where large 
amounts of oil are thought to be locked up. 

It will take years, perhaps decades, for 
CLCS to rule on the overlapping Arctic 
claims. Whoever wins the scientific contest 
still faces a diplomatic struggle. 

Denmark, Russia, and Canada have ex- 
pressed their desire to settle the situation 
peacefully. “Russia actually has played nice 
on this and stopped at the North Pole,’ 
rather than extending its claim along the 
length of the ridge, says Philip Steinberg, a 
political geographer at Durham University in 
the United Kingdom. Denmark had no such 
qualms. and put in a claim up to the edge 
of Russia’s EEZ, “even though there’s no way 
in hell they’ll get that,’ when 
it comes to the diplomatic dis- 
cussions, Steinberg says. 

One solution would be to use 
the equidistance principle, by 
drawing a median line between 
the coastlines, as has been 
done when proposed marine 
territories overlapped in the 
past; doing so would mean the 
North Pole falls to Denmark. 
There’s also a proposal to make 
the pole international, like Ant- 
arctica, as a sign of peace, says 
Oran Young, a political scien- 
tist at the University of Califor- 
nia, Santa Barbara. “It seems a 
very sensible idea.” & 
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Richard Kemeny is a science 
journalist in Sado Paulo, Brazil. 
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France is wary of science and 
vaccines, global survey finds 


Researchers attribute suspicion to institutional scandals 


By Tania Rabesandratana 


uelle surprise. France, cradle of the 

Concorde, the face transplant, and the 

first isolation of HIV, is more wary of 

vaccines and the economic value of sci- 

ence than more than 140 other coun- 

tries, according to a global survey of 
public attitudes toward science and health 
released this week. But French scientists 
say the skepticism is familiar and doesn’t 
affect their work; some suggest it instead 
reflects a deep-seated mistrust of institu- 
tions. “We think there’s a problem of trust 
in government, in particular in health au- 
thorities,’ says Pierre Verger, an epidemio- 
logist who studies vaccine 


diator, an amphetamine-based diabetes drug 
linked to hundreds of deaths, amid concerns 
of undue industry influence. The French have 
also been unhappy with heavy-handed public 
health campaigns, such as a costly mass vac- 
cination for swine flu that same year. “France 
is one of a few countries where [such contro- 

versies] have been so frequent,’ Verger says. 
The survey also reveals French pessi- 
mism about the economic value of science. 
Some 55% say they see science and tech- 
nology as a threat to local jobs in the next 
5 years. Although France is the only coun- 
try scoring above 50% on this question, a 
similar gloominess spans other parts of 
Europe, whereas most regions in Africa 
and Asia are optimistic about 


hesitancy at the French bio = science boosting job prospects. 
medical research institute IN- Just say ‘non’ The survey suggests France’s 
SERM in Marseille. When asked whether sluggish economy and relatively 


When asked whether vac- 
cines are safe, one-third of 
the 1000 French respondents 


vaccines are safe, a 
survey finds French people 


high unemployment as _ plau- 
sible causes. That fear about the 
future is understandable, says 


; disagree the most. . : iol 
to the survey disagreed—far COUNTRY DISAGREE Catherine Pélachaud, an artifi- 
more than in other nations. cial intelligence (AI) researcher 
(In the United States, 11% dis- France 33% at Sorbonne University in Paris. 
agreed.) The mistrust didn’t Gabon 26% “We see factories closing down 
vary much across age, gen- Togo 25% and it can be very hard for less- 
der, or education, according qualified people to adapt.” 
to the survey, conducted by Russia 24% It might be unfair to label 
the Gallup World Poll for the —_ Switzerland 22% France as averse to science based 
Wellcome Trust, a biomedical Armenia 21% on a single survey. There is plenty 
charity based in London. of enthusiasm for specific tech- 

Verger’s research points to Austria 21% nologies, according to a 2017 
high-profile health scandals Belgium 21% OpinionWay survey of 1059 peo- 
as an explanation. In 2009, Iceland 21% ple in France commissioned by 
for example, years after other Quattrocento, a scientific busi- 
countries, French regulators Burkina Faso 20% ness incubator in Paris. It found 
withdrew approval for Me- _ Haiti 20% that more than three-quarters of 
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Awoman gets vaccinated at a Paris hospital. 


respondents were hopeful about research 
in transport and renewable energies. At the 
same time, about two-thirds were worried 
about nuclear energy research and studies 
of genetically modified (GM) foods. The 
French government, a global leader in pro- 
ducing nuclear energy and exporting its 
technology, has largely ignored public dis- 
sent in that arena, but it has taken up public 
suspicions about GM foods by, for example, 
banning the cultivation of GM maize. 

Some French scientists are unsurprised by 
the survey results, and point out that opin- 
ions don’t always correlate to behavior. Of the 
French parents surveyed, 91% say their chil- 
dren are vaccinated, in line with the global 
average of 92%. “It’s the French paradox: We 
have doubts about many things; we grumble. 
But thankfully, vaccine coverage remains 
high,’ says Olivier Schwartz, scientific direc- 
tor of the Pasteur Institute in Paris, which 
depends on public donations to carry out 
its work, including vaccine research. “I don’t 
perceive a hostile climate,’ Schwartz adds. 
“On the contrary, I feel that [people in France 
have] a thirst for knowledge.” 

The survey data back this up: Some 71% of 
respondents in France say they know “some” 
or “a lot” about science—placing France in 
the top 10 globally. And 46% say they sought 
scientific information in the past month— 
compared with a median of 30% worldwide. 
Schwartz attributes the French vaccine skep- 
ticism to a lack of information, and says re- 
searchers and institutions need to fill that 
gap in a “simple and rigorous way.” 

But Brice Laurent, a sociologist at the 
Center for the Sociology of Innovation in 
Paris, warns against this “deficit model” of 
communication, which implies that igno- 
rance breeds skepticism and that people 
would embrace technologies if only they 
knew enough science. Studies show that 
more informed people are often more skep- 
tical, he notes. 

“Tt would be annoying if decision-makers 
saw this [survey] and thought, ‘French people 
just don’t get it, so we'll repeat the message,” 
Laurent cautions. He believes skepticism is 
ingrained in the culture of France, a place 
where all high schoolers are taught philoso- 
phy. “You could look at these stats and say: 
The French are critical thinkers; they are in- 
terested and ask questions,” he says. 

Pélachaud agrees that a vigilant public 
can help push scientists to consider societal 
impacts. “When I started 30 years ago, we 
didn’t ask ourselves ethical questions” in the 
AI field, says Pélachaud, who works on ani- 
mated chatbots capable of nonverbal com- 
munication. “The French may be grumblers, 
but their critical thinking is important.” & 
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STRUCTURAL BIOLOGY 


Mutant power resolves protein shapes 


Clever calculations on mutated proteins reveal spatial arrangement of amino acids 


By Robert F. Service 


apping the atomic structure of pro- 
teins is crucial to understanding 
how they behave, but it’s painstak- 
ing work that typically requires 
dedicated, expensive facilities with 
supercooled, powerful magnets 
or stadium-size synchrotrons. Now, two re- 
search teams independently report this week 
that they’ve hit on a way to use genetic and 
biochemical techniques to do the job, po- 
tentially opening structural biology to many 
more labs. And unlike traditional methods, 
which visualize proteins in crystals or solu- 
tion, the approach can also reveal proteins’ 
natural shape inside cells as they do their 
work, which could uncover how mutations 
that disrupt protein function lead to disease. 

“Tt’s. fantastic,’ says Douglas Fowler, a 
genome scientist at the University of Wash- 
ington in Seattle. Fowler notes the new ap- 
proach doesn’t offer the full atomic map that 
standard approaches do, but the general 
shape it provides for a protein nonetheless 
offers “extremely valuable” clues to its func- 
tion. Plus, he says, “It could have a really big 
impact” on efforts to determine structures of 
proteins that are membrane-bound or part 
of large complexes, both of which are diffi- 
cult to study with standard methods. 

Today’s most common approach for de- 
termining a protein’s structure requires 
coaxing millions of copies of a protein into 
an ordered crystal, blasting it with x-rays, 
and tracking their ricochets to reveal the 
identity and position of each atom. Alterna- 
tive approaches, including nuclear magnetic 
resonance spectroscopy and cryoelectron 
microscopy, also require large amounts of 
a protein and can take months. Researchers 
have unraveled the structures of only about 
150,000 of the millions of proteins thought 
to exist—and the process has taken decades. 

Some have tried to speed things up by 
predicting the most likely shape of a protein 
simply from its sequence of amino acids 
and probable interactions between atoms. 
The accuracy of such computational efforts 
typically lags behind experimental meth- 
ods. One recent trick to improve matters 
has been to compare the same protein in 
multiple species to find pairs of amino ac- 
ids that have evolved together even though 
they are far apart on the protein’s linear se- 
quence. That’s a strong indication the two 
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are close together and interact within the 
folded 3D molecule (Science, 22 July 2016, 
p. 338). But this approach works only if 
researchers can identify proteins that are 
shared by many organisms but different 
enough across them to identify multiple 
pairs of amino acids evolving in tandem, 
says Debora Marks, a systems biologist at 
Harvard Medical School in Boston. 

Now, separate teams led by Marks and 
Ben Lehner, a geneticist at the Barcelona In- 
stitute of Science and Technology in Spain, 
have bypassed the need for evolution’s help. 


Genetic tests have revealed protein structures (red) 
with nearly the accuracy of x-ray structures (gray). 


They independently hit on the idea of find- 
ing interacting amino acids within a pro- 
tein by systematically mutating each amino 
acid and tracking how the changes alter the 
protein’s function, such as an ability to bind 
to another molecule. Both groups built on 
work on a bacterial protein fragment called 
GB1 by a team led by Ren Sun, a systems 
biologist at the University of California, 
Los Angeles. In 2014, Sun’s team reported 
creating more than half a million copies of 
the GBI gene, each with one or two of its 
56 amino acids changed. For the so-called 
single mutants, the researchers systemati- 
cally swapped every amino acid for one of 
the 19 other options. In the double mutants, 
they changed pairs of amino acids, working 
through nearly all possible combinations. 
After growing bacteria with these mutant 
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genes and isolating the proteins, Sun’s team 
determined the importance of GBI’s amino 
acids by seeing which mutants bound most 
tightly to its natural target, human immuno- 
globulin G antibodies. 

Marks’s and Lehner’s groups realized they 
could combine the binding data of the sin- 
gle and double mutants to determine which 
amino acids interact most strongly and 
are therefore likely sit next to each other 
in the protein’s 3D structure. “Sometimes 
we see mutations that combine to have a 
much more dramatic effect,’ Fowler says. 
By tracking dozens of such occurrences 
and feeding the results into a structure pre- 
diction program, the teams computed the 
shape of GB1’s main backbone to within a 
few angstroms of the resolution of the al- 
ready known experimental x-ray structure. 

The teams, who report their success this 
week in Nature Genetics, also showed that 
their technique worked with other small pro- 
teins and an RNA with analogous available 
data. Fowler notes the same approach may be 
more difficult for proteins with hundreds or 
thousands of amino acids, because the num- 
ber of mutant proteins that must be made 
increases exponentially as the proteins grow. 
But Marks is optimistic: Early indications 
suggest the technique can solve structures 
with only a fraction of all possible mutants. 
The approach could even work using mea- 
sures of stability for proteins that lack known 
binding partners, Lehner’s team says. 

The method has other strengths. In 
March, Lehner and his team posted a pre- 
print on the bioRxiv server describing its 
use to learn how an RNA-binding protein 
called TDP-43 may cause the neurodegen- 
erative disease amyotrophic lateral sclero- 
sis (ALS). Insoluble aggregates of TDP-43 
have been seen in neurons in the autopsied 
brains of many ALS patients, but the depos- 
its may not cause the disease. So Lehner 
and his colleagues made 50,000 mutants of 
TDP-43 and tracked their toxicity in yeast 
cells. They found that mutant forms that ag- 
gregated were less toxic than other versions 
of the protein. “This is the exact opposite 
of what we expected,” Lehner says, caution- 
ing that they need to confirm this result 
in mammalian cells. Either way, he says, it 
shows that scanning mutations by the thou- 
sands may offer new insights into both pro- 
teins themselves and how their structures 
affect health in living cells. 
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LOST AT SEA 


A growing sensory smog threatens the ability of fish 
to communicate, navigate, and survive 


By Elizabeth Preston; J/lustrations by Valerie Altounian 


ry, for a moment, to be a fish. As 
you swim through dim waters, you 
see shapes moving past and watch 
for threats. You hear other ani- 
mals calling or producing rasps 
and crackles by scraping together 
rigid body parts. The water is a 
tapestry of smells that reveals 
predators and potential mates, 
food, and the route home. 

Now, imagine that nothing makes sense— 
the tapestry has unraveled. Smells 
still reach you, but their meanings 
are muddled. You listen for calls 
from your kin, but all you hear is the 
roar of a passing boat. You can’t tell 
whether that looming shadow is a 
friend or foe. 

When many people think of 
threats to the world’s fish, overfish- 
ing or vanishing reefs might leap to 
mind. Increasingly, however, scien- 
tists also worry about a subtler dan- 
ger: how human activities might 
interfere with the senses fish use 
to perceive the world. Noise from 
ships and construction, murkier 
waters caused by pollution, and 
rising ocean acidification from the 
buildup of atmospheric carbon dioxide 
(CO,) are all possible culprits. In laborato- 
ries and in the wild, scientists study exactly 
how those factors might affect a fish’s abil- 
ity to communicate, navigate, and survive. 

The studies face both logistical and con- 
ceptual challenges. Observing the behavior 
of fish in the vast sea is nearly impossible, 
but a laboratory aquarium is a far cry from 
their natural environment. And we can’t 
know exactly what seeing, smelling, or hear- 
ing as a fish is like. But by drawing on tools 
as elaborate as simulated underwater en- 
vironments and as simple as bits of thread 
tethering baby fish to stream bottoms, re- 
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searchers are gaining a better understand- 
ing of how fish use their senses—as well as 
the consequences of disrupting them. 
Driving the work is the concern that hu- 
mans are creating a pervasive threat to the 
fish stocks that provide food and livelihoods 
for millions of people: a kind of sensory 
smog. Combating causes such as pollution 
and acidification is a staggering challenge, 
the scientists admit. But the stakes are high. 
“The knock-on implications are huge,” says 


Jennifer Kelley, a behavioral ecologist at the 
University of Western Australia in Perth. 
When fish with compromised senses settle 
in the wrong homes or fail to recognize 
predators, the results could ripple outward 
“to how individuals interact and how com- 
munities operate, and the whole ecology of 
the system altering.” 


AN 11-DAY-OLD CLOWNFISH, pale orange and 
about as long as a grain of rice, searches for 
a place to settle down on a reef. Its keen 
sense of smell helps it both navigate to a 
safe home and steer away from the mouths 
of predators. 
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acidification 
can confuse how a 
clownfish reacts to predators. 


In the wild, clownfish inhabit living coral 
reefs. But in behavioral ecologist Danielle 
Dixson’s laboratory at the University of Del- 
aware in Lewes, the habitats beckoning the 
fish are made mostly of wires. Dixson will 
use the experimental setup to study how 
ocean acidification could alter how fish per- 
ceive and respond to their world. 

The laboratory—a converted garage with 
black bags taped over the windows to block 
bright light—holds two bays of fish tanks. A 
network of densely draped cables 
connects sensors in the aquariums 
to a black box on the wall, which 
researchers call “the brain.” It helps 
them monitor and control water 
temperatures and acidity levels in 
each tank. 

In some tanks, Dixson will keep 
clownfish and other species in sea- 
water with the acidity levels found 
now in the ocean. In other tanks, 
the water will be more acidic, to 
mimic the ocean chemistry that’s 
forecast for later this century if hu- 
mans do not curb CO, emissions. 
In both cases, conditions fluctuate 
during the day, as on a real reef. The 
researchers will look at how pH lev- 
els affect the way fish behave, interact, and 
respond to olfactory cues. 

Dixson is building on research showing 
that acidification can jumble a fish’s re- 
sponse to smells. For example, she and col- 
leagues reported in Ecology Letters in 2010 
that young clownfish raised in more acidic 
waters were strongly attracted to the smell 
of a predator, which the fish would normally 
avoid. In some predatory fish, acidification 
had the opposite effect. Dixson and co- 
authors found that after 5 days in wa- 
ter with high levels of CO,, sharks called 
smooth dogfish avoided the smell of their 
prey. Other researchers found a similar 
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result in the brown dottyback, a reef- 
dwelling predator. 

Acidification also seems to cause other be- 
havioral changes. In boldness tests—in which 
researchers approach fish with a blunt ob- 
ject to make them retreat—fish treated with 
acidic water come back out of hiding sooner. 
Those fish are like angry teenagers, Dixson 
says. “They’re really aggressive, but they re- 
ally don’t know what they’re doing.” 

The problem isn’t that acidified water 
damages fish noses, Dixson says. Instead, 
the issue is apparently in the brain. Multi- 
ple mechanisms may be at work. One strong 
possibility is that acidic conditions inter- 
fere with brain cell receptors that respond 
to y-aminobutyric acid (GABA), a neuro- 
transmitter. All vertebrates share GABA re- 
ceptors, which inhibit neuron activity. They 
are “like the brake on electrical circuits in 
the brain,” says biologist Géran Nilsson 
of the University of Oslo who, along with 
Dixson and others, first highlighted the 
potential connection between GABA recep- 
tors, fish behavior, and acidification in a 
2012 Nature Climate Change paper. 

Normally, when GABA binds to a recep- 
tor, it opens a channel that lets negatively 
charged chloride ions flow into the neu- 
ron. But that flow reverses in fish living 
in acidified water, studies suggest. That’s 
because, physiologically, the fish cope with 
the more acidic environment by accumulat- 
ing bicarbonate ions, which are basic, and 
by shedding chloride ions. When the chan- 
nel is opened, chloride ions flow out of the 
neuron instead of into it, and the neuronal 
“prake actually becomes an accelerator,’ 
Nilsson says. That hypothesis has drawn 
support from studies in which researchers 
dose fish with gabazine, a drug that sup- 
presses GABA receptors. After a quick dip 
in gabazine-laced water, formerly confused 
fish act normal again. 

Acidification similarly affects fish vision 
and hearing, perhaps also by scrambling 
GABA receptors. In one study, researchers 
treated young damselfish with high-CO, 
water and then placed a plastic bag hold- 
ing a predatory fish in their tank—so the 
damselfish could see the predator, but not 
smell it. Normally, that apparition would 
make the damselfish lie low. Instead, fish 
from acidified water ignored or swam close 
to the bag. In another study, researchers 
used an underwater speaker to play young 
clownfish a recording of a coral reef teem- 
ing with predators. Clownfish raised in 
high-CO, water didn’t flee from the sound, 
as they normally would. 

Other researchers have found that oto- 
liths, the little chunks of calcium carbonate 
in the inner ears of vertebrates, are larger 
than normal in some (but not all) of the 
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Cod use sound to 
communicate, but noisier seas 
are interfering with their conversations. 


fish species they raised in acidified waters. 
Changes to otolith size could affect fish 
hearing and orientation, researchers say. 
Different fish species are vulnerable to 
different degrees of ocean acidification. In 
one study, researchers found that condi- 
tions mimicking an atmospheric CO, level of 
700 parts per million (ppm) addled about 
half of clownfish; all the fish were affected 
at 850 ppm. At current emission rates, fish 
populations could experience those levels 
well before the end of the century, leaving lit- 
tle time to adapt. “It doesn’t seem like there’s 
a lot of wiggle room for evolution or natural 
selection to take place,’ Dixson says. 


IT’S EARLY SPRING in the Atlantic Ocean, 
and an adult cod is journeying back to his 
preferred spawning grounds to find a mate. 
He grunts while he swims. He and the other 
migrating cod add their grunts to a marine 
cacophony: the barks, mutters, clucks, and 
chirps of other fish; the singing of whales; 
the steady munching of sea urchins below. 
Light rain above tinkles musically, reaching 
a crescendo when heavier storms pass. The 
noises reverberate in the cod’s belly, where 
his balloonlike swim bladder acts as an ex- 
tension of his ears. 

What cod or other migrating fish are say- 
ing as they travel is not clear. They may be 
calling to stragglers, synchronizing migra- 
tion or spawning, or establishing domi- 
nance. But clearly, the fish increasingly must 
compete with human noise sources such as 
recreational boats, commercial ships, pile 
driving, sonar, and deep-sea mining. 

Marine ecologist Jenni Stanley at the 
National Oceanic and Atmospheric Ad- 
ministration’s Northeast Fisheries Science 
Center and the Woods Hole Oceanographic 
Institution in Massachusetts has mea- 
sured the impact of the added noise. 
She and colleagues recorded ambient 
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underwater sounds at spawning sites for 
cod and haddock in Massachusetts Bay. Un- 
like the grunting cod, haddock produce an 
insistent knocking sound that can go on for 
hours. At times with less competing noise 
from vessels, the fish calls could carry more 
than 20 meters, the researchers reported 
in 2017 in Scientific Reports. But at noisier 
times, the calls carried only a meter or so 
before being drowned out. 

Researchers don’t know whether such 
interference has affected overall popula- 
tions of cod and haddock. But scientists 
worry about subtle harms that could ulti- 
mately take a toll. “I think the critical issue 
.. is the impact on the soundscape,” says 
biologist Arthur Popper of the University of 
Maryland in College Park. At least 800 fish 
species make sounds. But “even those that 
don’t communicate with sound are still lis- 
tening to their environment,” Popper says. 
They use sound to identify predators, for 
example, or suitable habitat. 

A noisier world could therefore have 
“potentially dire consequences” for fish, 
warned the authors of a 2018 meta-analysis, 
published in Global Change Biology, which 
examined 42 pertinent studies. Biologist 
Kieran Cox of the University of Victoria in 
Canada and co-authors found that noise 
can hurt fishes’ ability to find food, increase 
their risk of being eaten, and reduce their 
reproductive success. 

Just how well many fish can hear in the 
first place is uncertain. “What we don’t 
know is so gigantic,” Popper says. Studying 
fish behavior in the ocean over long periods 
is challenging. But laboratory studies have 
their own limitations, Popper points out: 
Aquariums can change how sound waves 
travel, and confined fish may not behave as 
they would in the wild. 

Scientists do know that fish are vulner- 
able to loud events. Popper’s laboratory, for 
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example, used a large contraption to mimic 
the sounds created by a pile being ham- 
mered into the sea floor. The researchers 
found that as the pounding sound passes 
through a fish’s body, the swim bladder 
can knock into other tissues hard enough 
to cause serious injuries. In the wild, fish 
deaths have been observed near areas of 
pile driving. Fish in the open ocean can of- 
ten flee far from loud noises, Popper notes, 
but those in more constrained freshwater 
lakes and rivers likely can’t escape the din. 


THE BELLY OF A MALE three-spined stickle- 
back, normally dull in color but stained 
red-orange for the breeding season, blushes 
even more deeply as he swims toward a fe- 
male. He faces her and repeats a zigzagging 
dance, darting left and right. If he’s lucky, 
the female follows him to his nest in 
the sand and lays her eggs. 

“Theyre quite cute when they 
perform their courtship behavior!” 
says ecologist Ulrika Candolin of 
the University of Helsinki. Stickle- 
backs are widespread in the ocean 
and in lakes; the population she 
studies lives in the slightly salty 
Baltic Sea. In recent years, she says, 
nitrogen and phosphorus pollution 
has fertilized algal blooms so severe 
that, on warm days, they turn the 
Baltic’s water green. 

In waters thick with algae, both 
laboratory and field studies have 
shown that romance-minded stick- 
lebacks “have more difficulties de- 
tecting each other,’ Candolin says. 
When a pair does connect, the male 
spends longer than usual performing his 
display. And the female spends more time 
inspecting the male, as though unsure of 
what she’s seeing. 

Such visual interference can harm sexual 
selection, Candolin and colleagues found. 
Stickleback mating becomes more ran- 
dom in murky water, with females hav- 
ing a greater chance of selecting a less fit 
male, they reported in a 2016 Ecology pa- 
per. The consequences include having fewer 
surviving offspring. (Candolin notes that 
smell might also be a factor in poor mate 
choices—the algal blooms may obscure 
scent information that female sticklebacks 
would normally use to judge males.) 

So far, Candolin notes, the sloppier mat- 
ing caused by murky waters isn’t hurting 
stickleback populations in the Baltic. Their 
numbers are actually growing; a decline in 
predators, also due to ecosystem changes, 
might be a factor. But Candolin’s work has 
helped highlight how visual pollution might 
take a toll in the future. “Females are wast- 
ing their time and energy” when they select 
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suboptimal mates, says Grant Brown, an 
ecologist at Concordia University in Mon- 
treal, Canada, who says Candolin’s work on 
mate choice is “very nicely done.” 

Another risk of not clearly seeing your 
partner is that you will mate with the 
wrong species. Africa’s Lake Victoria, for 
example, hosts hundreds of species of fish 
called cichlids. Females choose mates on 
the basis of their distinctive bright col- 
ors. But the number of cichlid species has 
declined, researchers found in the 1990s. 
Meanwhile, runoff from deforestation and 
agriculture has fertilized and clouded the 
lake waters, causing females to struggle to 
recognize males from their own species. 
They seem to end up mating with other 
species, resulting in hybrid offspring that 
sport duller hues. Not only could hybrid- 


ization take a toll on species diversity, 
Brown says, but the mismatched mates 
may produce offspring that can’t compete 
or reproduce, harming populations. 
Polluted water may obscure other crucial 
signals. Brown, for example, has found in the 
lab that chemical pollutants appear to mask 
important molecules that fish use to sniff 
out danger. Fish release those molecules in 
their urine when disturbed; the odors warn 
other fish that a predator may be nearby. 
But high levels of nitrogenous pollutants, 
such as fertilizer runoff in streams, can 
overpower the warning scents. Brown says 
detecting the signal becomes “like trying to 
hear someone in a very crowded room with 
lots and lots of background noise.” 
Pollution can also chemically alter a dif- 
ferent warning signal, he found: molecules 
that leak from fish skin when damaged, for 
example, by a predator’s attack. Freshwa- 
ter acidification, caused not by rising CO, 
levels, but by airborne sulfate and nitrate 
pollution, may render those molecules un- 
recognizable. To better understand the pos- 
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Murky waters can 
complicate stickleback mating. 


sible consequences, Brown and co-author 
Chris Elvidge, now of Carleton University in 
Ottawa, used meter-long threads to tether 
baby Atlantic salmon to stream bottoms. 
Normally, the young fish would respond 
to the smell of other fishes’ alarm cues by 
dropping to shelter in the gravel. But when 
the researchers returned 6 hours later, the 
fish in more acidic streams, which were 
less able to detect those cues, were like- 
lier to have vanished, apparently because 
they didn’t duck from danger. Sometimes a 
full-bellied trout had taken the place of the 
young salmon on the end of the thread. 


RESEARCHERS ARE NOT only document- 
ing how sensory smog harms fish, but also 
searching for ways to protect them from it. 
Some marine reserves, for example, already 
ban certain activities, such as con- 
struction or exploration for oil. But 
as researchers learn more about 
aquatic soundscapes, they can 
imagine other ways of protecting 
fish from unwanted sound, such as 
by barring loud motors or industrial 
activities from key fish spawning 
grounds during certain seasons. 

Larger global issues, such as 
ocean acidification and polluted 
runoff, could be harder to tackle. 
Even here, however, “There are 
rays of hope,” Brown says. For in- 
stance, he found that fish can as- 
sociate a given smell not only with 
risk, but with a certain habitat or 
time of day. If a predator hunts 
only at dawn or dusk, the prey fish 
may learn to ignore the smell of 
that predator at midday. In a world where 
some of their sensory cues are masked or 
changed, fish may find new cues and learn 
new rules to live by. 

Fish may also be able to cope with some 
interference because they have multiple 
ways of sensing the world. For example, 
fish have lateral lines, a set of organs on the 
outside of the body that sense pressure and 
currents and help fish orient themselves. 
Some species can also sense magnetic 
fields. Sharks and their relatives can de- 
tect electricity. “In some cases, if one sense 
is blocked or unusable, then fish will just 
switch to another one,” Kelley says. “That 
makes them very resilient in that context.” 

The rapid pace of environmental change, 
however, is testing the sensory resilience of 
fish as never before. The outcome of rising 
sensory smog “could be not as bad as we’re 
anticipating,’ Dixson says. “Or it could be 
1000 times worse.” & 


Elizabeth Preston is a journalist in 
Arlington, Massachusetts. 
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Do tiny fish rule the reefs? 


Covert fish larvae may serve as crucial cuisine in coral reef ecosystems 


By Cynthia Riginos' and Jeffrey M. Leis?* 


oral reef fishes are famous for their 
fantastic colors and forms. Easily 
overlooked are the cryptic and dimin- 
utive (cryptobenthic) bottom-dwell- 
ing fishes that also call coral reefs 
home. By linking empirical data and 
ecosystem modeling, Brandl et al. (1), on 
page 1189 of this issue, propose that pelagic 
(open-water) larvae of cryptobenthic fishes 
and their small juveniles that recently set- 
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tled on reefs constitute a key food source for 
other reef residents. Such a scenario could 
help explain why coral reefs in nutrient- 
poor waters teem with life. 

Small fishes live fast and die young (2). 
Although each fish might only spawn a few 
eggs at a time, Brandl et al. argue that the 
sheer number of spawning fishes at any 
time could provide a near-constant supply 
of pelagic larvae (see the figure). Pelagic lar- 
vae from cryptobenthic reef fishes (CRFs) 
tend to remain near reefs and lagoons (3, 4) 
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and thus are ready prey for reef residents. 
By contrast, large fishes live long and ma- 
ture slowly. Although they produce more 
eggs per spawn than small fishes (5), large 
fishes are less numerous and tend to spawn 
seasonally, so the total number of pelagic 
larvae produced is smaller and these lar- 
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vae develop offshore more often relative to 
CRFs. The net effect of these contrasting 
life histories, Brandl et al. suggest, is that 
larvae of CRFs greatly outnumber other 
larval fishes in near-reef waters and thus 
feed the “wall of mouths” of reef-dwelling 
planktivores. The abundant CRF juveniles 
and short-lived small adults also provide 
nutrient inputs to the reef system. This 
new demographic model suggests that the 
contributions of CRFs to nutrient cycling in 
coral reef ecosystems are more important 
than previously appreciated (6). 

The notion that substantial proportions of 
nutrients are recycled within a reef system 
by way of CRFs, if confirmed, represents an 
important advance toward understanding 
how coral reefs can be so productive. Coral 
reefs are surrounded by oceanic “deserts” 
of low-nutrient waters, and thus their high 
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Distinct life histories shape coral reefs 

CRFs produce few eggs, but because adults are abundant, pelagic larvae and postsettlement juveniles are 
plentiful and might constitute a generous food source. Large reef fishes each spawn many eggs, but there are 
fewer adult fishes. Thus, the amount of larvae is lower and provides less of a food resource. 


Postsettlement 
juveniles 
develop 

quickly. 


productivity and biodiversity have long per- 
plexed scientists. If CRFs are instrumental 
for local nutrient retention, then they would 
constitute a crucial guild of species within 
reef ecosystems. In turn, if CRFs fulfill a key 
ecological role, then the ecological and evo- 
lutionary dynamics that ensure their local 
persistence—especially the extent to which 
young-adult fishes return to their parents’ 
reef location or disperse to other neigh- 
borhoods—are highly consequential to the 
health of coral reef ecosystems. 

Although the idea that CRFs perform crit- 
ical ecosystem services is compelling, defin- 
itive demonstration of such a seminal role 
is challenging. In particular, the pelagic en- 
vironment in which larvae develop remains 
poorly understood; thus, much of the infor- 
mation about fish life histories and species 
interactions necessary to reconstruct demo- 
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Postsettlement 
~ juveniles develop 


Cryptobenthic reef fish (CRF) 
CRF larvae develop in near-reef waters. 


Many non-CRF larvae develop in 
open water. Relatively fewer 
larvae settle in coral reefs and are 
small in size compared to adults. 


Large reef fish 


Larvae 
Life span 
~3 to 60+ years 


Many eggs released 
into water column 


graphic dynamics is unknown. Because of 
these many uncertainties, Brand] et al. were 
understandably forced to estimate most 
life-history parameters and make simplify- 
ing assumptions. 

For example, few key life-history param- 
eters—such as fecundity, mortality, settlement 
size, and growth rate—are well understood at 
the species level, which required the authors 
to estimate such parameters at the family or 
order level. Such inferences based on higher 
taxonomic levels might obscure CRF charac- 
teristics. Brand] et al. define CRF families as 
having at least 10% of species smaller than 5 
cm, but in only four of the 17 known CRF fam- 
ilies are >50% of species known to be smaller 
than 5 cm (2). Thus, most trait estimates were 
derived from species that do not match the 
CRF criterion set by Brandl et al. For instance, 
cardinalfishes (Apogonidae) are often neither 
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small (80% of species are >5 cm) nor cryptic, 
with many species conspicuously schooling 
in the water column in daytime. Determining 
life-history attributes for more species would 
enable future updates of demographic eco- 
system models based on species-level char- 
acteristics, thereby increasing precision and 
confidence in results. 

A second simplification reflects the pau- 
city of information regarding near-reef 
plankton composition. Because of biases 
in sampling methods (7) and challenges in 
identifying animal larvae, studies typically 
focus on selected taxonomic groups. For ex- 
ample, invertebrates are the most numerous 
constituents of zooplankton in reef waters, 
yet typical inventories focused on fishes do 
not report invertebrate abundances (3, 4). 
To aggregate larval fish data across dispa- 
rate studies, Brandl et al. focused on famil- 
ial proportions of reef fishes rather than 
abundances, which obscures important 
aspects of nutrient cycling, especially inver- 
tebrate contributions to reef residents’ nu- 
trition. Comprehensive plankton-sampling 
regimens combined with species-level iden- 
tifications of larvae would yield improved 
estimates of nutrient exchange in reefs 
and also show how larval retention differs 
among species. 

Also unknown is the extent to which 
CRF larvae and juveniles are consumed by 
other fishes and invertebrates. It is possible 
that conventional approaches for examin- 
ing the contents of fish guts underestimate 
CRF consumption (7). DNA sequencing data 
show promise for evaluating the full com- 
plexity of reef fish demographics and nutri- 
ent cycling (8, 9). 

Against the backdrop of pervasive coral 
reef bleaching lies an urgent need to ac- 
curately understand and quantify coral 
reef biodiversity. The study of Brand] e¢ al. 
underscores our ignorance regarding the 
lives of larvae and their participation in 
coral reef ecosystem dynamics. Collecting 
life-history information at a species level 
can seem mundane, but such fundamental 
investigations with standardized sampling 
methods are crucial to the design of biologi- 
cally realistic models. 
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Ruminants: Evolutionary past 
and future impact 


The genomes of ruminant animals promise advances for 
agriculture, conservation, and biomedicine 


By Dai Fei Elmer Ker’? 
and Yunzhi Peter Yang*** 


uminants are mammals of consid- 

erable agricultural, conservational, 

and biomedical importance. The 

approximately 200 extant species of 

this clade include traditional live- 

stock such as cattle and sheep, en- 
dangered species such as Pére David’s deer 
or milu, and species of biomedical interest, 
such as antlered deer (J-8). Ruminants are 
extremely diverse but share in common a 
multichambered stomach specialized for 
digesting tough plant fibers through micro- 
bial-aided fermentation (/, 4). Collectively, 
ruminants are highly successful, having col- 
onized multiple terrestrial environments, 
including the unforgiving conditions of the 
Arctic tundra (3, 4). On pages 1152, 1153, 
and 1154 of this issue, Chen e¢ al. (1), Wang 
et al. (2), and Lin et al. (3), respectively, se- 
quenced the genomes of multiple ruminant 
species, offering resources and analyses for 
understanding their evolution and diver- 
sity. The findings provide vital insights into 
genetic adaptations that are responsible for 
their biological success, as well as how they 
have been affected by human activity. 

Chen et al. assembled the genomes of 44: 
ruminant species from all six Ruminantia 
families (Tragulidae, Antilocapridae, Giraf- 
fidae, Cervidae, Moschidae, and Bovidae), 
This substantially adds to the library of 
previously sequenced ruminant genomes, 
including cattle, yak, goat, sheep, Masai gi- 
raffe, okapi, and red deer. With their dataset, 
Chen et al. analyzed Ruminantia evolution- 
ary divergences. Their methodology used 
the distantly related killer whale as an out- 
group, the well-annotated goat genome as a 
reference for orthologous syntenic sequences 
(evolutionarily related genes colocalized on 
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the same chromosome), and the fossil record 
as a means to calibrate molecular rates of 
evolution, producing a whole-genome phylo- 
genetic tree for 51 ruminant species. The evo- 
lutionary insights gained include identifying 
a closer (sister-group) relationship between 
Antilocapridae and Giraffidae. Previous mi- 
tochondrial DNA-based phylogenies placed 
Antilocapridae as an outgroup relative to 
other pecorans. Additionally, they confirmed 
Moschidae as a sister group of Bovidae and 
better refined Bovidae subfamily relations 
and species positions (1). These findings pro- 
vide strong evidence for adjudicating the po- 
sitions of ambiguous taxa. 

Chen et al’s dataset also provides a win- 
dow to the past. Using methods to infer 
ruminant demographics during the Pleis- 
tocene (~2.58 million to 12 thousand years 
ago), they found that major population 
declines coincided with increased human 
(effective) population size. This implies 
that humans may have contributed to ru- 
minant declines during the late Pleistocene 
and holds important conservation lessons. 
When analyzed with nonruminant genomes 
and ruminant transcriptomic (gene ex- 
pression) data, genome evolution could be 
deciphered. Such analyses identified fast- 
evolving genes, positively selected genes, 
and newly evolved genes that may have 
important roles in ruminant trait evolution. 

Wang et al. studied the evolution of rumi- 
nant headgear (antlers, horns, ossicones, and 
pronghorns), which has implications for re- 
generative biology and potentially for cancer 
biology. These cranial appendages serve di- 
verse functions that include mate selection, 
feeding, and sensing. Ruminant headgear 
share similarities in that they comprise a 
bony core covered by integument (skin and/ 
or keratinized sheath) and are located on the 
frontal cranial bones. However, these head- 
gear are unique in that four of the ruminant 
families—Cervidae, Bovidae, Giraffidae, and 
Antilocapridae—each possess a_ different 
type—antlers, horns, ossicones, or prong- 
horns, respectively (see the figure). In par- 
ticular, antlers are a research focus for tissue 
engineering and regenerative medicine be- 
cause they are deciduous bony structures 
that typically regenerate rapidly. Indeed, 
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antlers can grow as much as 2.5 cm per day 
and produce 10 kg of osseous (bone) tissue 
within a few months (4, 6, 7). Although this 
phenomenon was noted in Aristotle’s His- 
tory of Animals more than 2000 years ago 
(9), the underlying molecular mechanism(s) 
are only just beginning to be understood. 

By leveraging the data of Chen e¢ al., 
Wang et al. found that the most parsimoni- 
ous explanation for headgear evolution is 
that this trait evolved once and was inde- 
pendently lost in Tragulidae and Moschi- 
dae. Comparative analysis of 68 goat, 162 
sheep, 20 roe deer, and 20 sika deer tran- 
scriptomes sourced from 16 tissues identi- 
fied 201 common headgear-specific genes 
as well as 624 horn-specific and 761 antler- 
specific genes. Additional analysis iden- 


cancer in cervids, while also promoting 
rapid antler growth. 

Lin et al. examined molecular adapta- 
tions that enable reindeer to thrive in harsh 
Arctic and subarctic conditions, including 
severe cold, limited food availability, and 
prolonged periods of light or darkness. 
Reindeer have developed several distinct 
features, including specialized fat metabo- 
lism to limit heat loss, little-to-no circadian 
rhythm (daily cycle) to adapt to extreme 
light periodicity, and efficient vitamin D 
metabolism to facilitate rapid antler growth 
in the presence of low light. In addition, 
reindeer are unique within Cervidae in two 
aspects: They are the only species in which 
females possess antlers and the only fully 
domesticated species. 


Geographic distribution of endangered ruminants 

Ruminants are highly successful, having colonized multiple terrestrial environments, but they remain 
threatened or endangered. This diverse group comprises six families, four of which have different types 
of headgear. The map shows distributions for threatened or endangered ruminant species included in the 
International Union for Conservation of Nature Red List of Threatened Species (16). 


Cervidae 
(Antlers) 


Bovidae 
(Horns) 


Antilocapridae 
(Pronghorns) 


tified highly conserved gene regulatory 
elements implicated in stem cell pluripo- 
tency and the transforming growth factor- 
8 (TGF-B) signaling pathway, indicating 
that a single mechanism—neural crest- 
mediated bone differentiation—may be in- 
volved. This provides further support that 
ruminant headgear may have originated 
once. Further bioinformatics analysis re- 
vealed that oncogenesis- and neurogene- 
sis-associated pathways may mediate rapid 
antler regeneration. In particular, high ex- 
pression of tumor suppressors, including 
promyelocytic leukemia protein (PML), 
was identified as being under positive se- 
lection. This may explain the low rate of 
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Using comparative genomics analyses 
with ruminant and nonruminant species, 
Lin et al. identified mutations that facilitate 
the Arctic adaptation and domestication of 
reindeer. These include mutations in genes 
and regulatory sequences that presumably 
enhanced vitamin D metabolism, modified 
fat production and transport, facilitated 
female antler growth in the presence of 
low androgen levels, disrupted circadian 
rhythm, and enhanced docility. Therefore, 
the study of Lin et al. reveals a genetic basis 
for the evolutionary success of reindeer. 

The genomic resources and evolution- 
ary insights derived from these studies are 
likely to contribute to ruminant agriculture 
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and conservation. These contributions may 
include improving agricultural economics 
and food security; reducing disease trans- 
mission; and minimizing human environ- 
mental impact, for example, by identifying 
and monitoring endangered species (JO, 11). 

Additionally, studies of deer antlers offer 
attractive approaches for tissue engineering 
and regenerative medicine. For instance, 
deer antlers have inspired a commercially 
promising prosthesis for amputees (8). This 
prosthesis mimics the soft-to-hard (skin-to- 
bone) tissue interface of deer antlers, allow- 
ing secure intraosseous and transcutaneous 
attachment while minimizing infection. 
During antler regeneration, open wounds 
form on top of a deer’s pedicles (the cranial 
structures from which antlers regenerate), 
which is quickly followed by scab forma- 
tion, velvet skin transformation (which 
is crucial to antler formation), and antler 
morphogenesis. The ability of cervids to 
rapidly generate large quantities of inner- 
vated bone with low tumor and infection 
incidence suggests that studying antlero- 
genesis may offer insights for addressing 
large skeletal defects, achieving infection 
control, regenerating nerves, and possibly 
even limiting cancer growth (4-7). 

Typically, regenerative biology is studied 
using small organisms such as hydras (12), liz- 
ards (13), naked mole-rats (14), and salaman- 
ders (15). Studying large mammals such as 
deer is logistically and ethically challenging 
(4). Although such challenges can be avoided 
with in vitro approaches to identify differen- 
tially expressed genes of interest (6), the lack 
of genomic resources and their accompany- 
ing insights presents tremendous research 
barriers. Thus, these three studies (J-3) are 
extremely valuable. Future work that vali- 
dates the function(s) of identified genes will 
likely have major impacts on society. 
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Miding in plain sight 


A common biomarker of pancreatic disease has a functional role in pathogenesis 


By Christopher J. Halbrook! and 
Howard C. Crawford?* 


ancreatic cancer remains one of the 

deadliest human malignancies. This 

is in part due to the lack of a reli- 

able method of early detection, be- 

cause late-stage disease is largely 

refractory to treatment. Biomarkers 
for early disease detection have remained 
elusive. However, the glycan carbohydrate 
antigen 19-9 (CA19-9), which is produced 
by pancreatic cancer cells, is increased in 
the serum of most patients 
(1). It is clinically useful as a 
biomarker of tumor burden 
during treatment, rather than 
for early detection, because se- 
rum CA19-9 is also increased 
in other diseases, including 
inflammation of the pancreas 
(pancreatitis), a risk factor for 
the development of pancre- 
atic cancer. To date, there has 
been little understanding of 
CA19-9 function in pancreatic 
pathophysiology. On page 1156 
of this issue, Engle et al. (2) re- 
port that CA19-9 drives the de- 
velopment of pancreatitis and 
accelerates pancreatic tumor 
progression. 

CA19-9, also known as sialy]- 
Lewis’, is a glycan moiety that 
modifies proteins, lipids, and 
other glycans. In humans, the 
expression of CA19-9 is depen- 
dent on Lewis (Le) blood type 
(3), a human blood group sys- 
tem based on the expression of 
two fucosyltransferase (FUT) 
enzymes, FUT2 and FUT3. Individuals 
with the blood type Le(a-b-) do not gener- 
ate CA19-9 because they lack FUT3; mice 
also do not express this enzyme and there- 
fore cannot produce CA19-9. Functionally, 
CA19-9 can promote cancer cell metasta- 
sis by binding to the adhesion protein E- 
selectin that is expressed on the surface 
of endothelial and mesothelial cells (4). 
This binding presumably enhances migra- 
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tion of metastatic cells through these cell 
layers. Additionally, CA19-9 expression in 
colon cancer cells replaces the structur- 
ally related disialyl Lewis* expressed by 
normal colon epithelia (5). Disialyl Lewis* 
produced by normal epithelium serves an 
immune inhibitory role by binding sialic 
acid-binding immunoglobulin-like lectins 
(siglecs) on resident macrophages, sup- 
pressing the expression of proinflamma- 
tory cyclooxygenase-2 (6). Accordingly, it 
is thought that the replacement of disialyl 
Lewis* by CA19-9 during carcinogenesis re- 


Signaling underlying pancreatic disease 
Posttranslational modifications by CA19-9 can confer new functions to proteins. 
CA19-9:fibulin-3 can act as an EGFR ligand, thereby activating the downstream 

RAS-RAF-MEK-ERK signaling cascade. 
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lieves this immune suppression through its 
failure to engage siglecs, thereby promot- 
ing tumor progression. Apart from these 
generic functions, a pathophysiological 
function induced by CA19-9 modification 
of a specific protein has not been reported 
until now. 

Pancreatic cancer cells nearly univer- 
sally contain activating mutations in the 
KRAS oncogene. Expression of epidermal 
growth factor receptor (EGFR) is required 
for induction of Kras-driven tumorigenesis 
in the pancreas of mice (7, 8) by increasing 
total RAS activity above a minimal thresh- 
old that is required for transformation. 
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Concomitantly, the EGFR-RAS-mitogen- 
activated protein kinase (MAPK) signaling 
axis in pancreatic epithelium increases the 
production of proinflammatory cytokines 
(9). These cytokines recruit immune cells, 
which are key drivers of pancreatitis and 
pancreatic tumor development and main- 
tenance (J0-12). The numerous reactive 
inflammatory and stromal cells are in turn 
a rich source of EGFR ligands (13, 14), in- 
voking the involvement of a vicious cycle 
of chronic EGFR signaling and inflamma- 
tion in benign pancreatic disease. In later 
stages of pancreatic cancer, 
EGFR is largely dispensable 
(7), and inhibiting it in pancre- 
atic cancer patients provides 
minimal benefit (75). 

Using genetic engineering, 
Engle et al. created a mouse 
model that expresses both 
FUT3 and the galactosyltrans- 
ferase B3GALT5, allowing the 
production of CA19-9. They find 
that CA19-9 synthesis promotes 
modification of the extracel- 
lular matrix protein fibulin-3, 
producing CA19-9:fibulin-3. 
This activates pancreatic EGFR 
signaling, inducing a pathology 
that is similar to pancreatitis, 
which is reversible either by 
shutting down the expression 
of the CA19-9 synthesis ma- 
chinery or by targeting CA19-9 
itself (see the figure). Thus, in- 
hibiting this newly described 
function of CA19-9 presents an 
attractive potential target for 
the treatment of early-stage 
pancreatic cancer and pancre- 
atitis patients. 

The results of Engle et al. suggest that 
the canonical EGFR ligands produced by 
the pancreatic epithelium or the inflam- 
matory and stromal cells (7, 12) are mini- 
mally involved in maintaining the disease 
state. One potential explanation for this 
is that CA19-9:fibulin-3 engages and ac- 
tivates EGFR with distinctive kinetics 
compared to other ligands. Alternatively, 
CA19-9:fibulin-3 is necessary but not suf- 
ficient to maintain the phenotype without 
collaborating with other CA19-9-modified 
molecules. This latter possibility seems 
more likely because, as shown by Engle 
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et al., blocking CA19-9 is more effective at 
reversing the pancreatitis phenotype than 
inhibiting EGFR itself, which causes severe 
pancreatic atrophy (wasting). 

Although the discovery of a new func- 
tion of CA19-9 is exciting, Engle e¢ al. 
have also potentially revitalized a role for 
CA19-9 in the early detection of pancreatic 
cancer. A major issue for many putative 
early detection markers, including CA19-9, 
has been the limited ability to distinguish 
between chronic pancreatitis and pancre- 
atic cancer. With their transgenic mouse 
model and the organoids (cells grown as 
three-dimensional cultures) derived from 
them, they have generated an optimal 
system with which to identify CA19-9- 
modified proteins that can distinguish the 
metaplastic epithelia found in pancreatitis 
from the cancerous epithelia of adenocar- 
cinoma. Finding specific cancer-associated 
CA19-9-modified proteins in the circula- 
tion could greatly enhance the specificity 
of a biomarker compared to either CA19-9 
or the unmodified protein alone. 

To date, there are still no effective thera- 
pies or early detection methods for pancre- 
atic cancer. This is arguably because of a lack 
of understanding of the biology underlying 
the disease, a void illustrated by the study of 
Engle et al., which ascribes a critical function 
to the most commonly used biomarker of the 
disease, first identified over 30 years ago (J). 
The translational potential of the findings of 
Engle et al. also reinforce the value of study- 
ing the events that precede and promote 
the formation of pancreatic cancer. These 
observations lend much needed insight into 
the understanding of pancreatic cancer and 
can inform new treatment strategies for the 
advanced stages in which it is usually diag- 
nosed in patients. 
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The gut microbiota 
and colon cancer 


Microbiome data should be incorporated into the 
prevention, diagnosis, and treatment of colon cancer 


By Wendy S. Garrett 


he human microbiota is the collec- 

tion of microorganisms—bacteria, 

archaea, viruses, fungi, protozoa, and 

helminths—that populate the human 

body. They are emerging as an impor- 

tant feature of human health and dis- 
ease. Currently, access to the genomic data 
of human cells and of microbiota (micro- 
biomes) is more affordable and accessible 
than ever before. A major challenge is to 
unravel how we integrate microbiome data 
into precision medicine approaches for the 
prevention, diagnosis, and treatment of dis- 
eases such as cancer. The gastrointestinal 
(GD) tract is densely populated with micro- 
organisms. Colorectal cancer (CRC) is the 
third most prevalent cancer worldwide. It is 
increasing in individuals less than 50 years 
old and is associated with specific dietary 
factors and eating patterns that affect the 
gut microbiota. Therefore, CRC seems ripe 
for microbiome-based prevention, diagnos- 
tics, and therapeutics. 

Over the past 10 years, several species of 
bacteria have garnered attention for their 
associations with, and potential roles in, 
colorectal carcinogenesis (see the figure). 
Some, such as Fusobacterium nucleatum, 
initially emerged from sequencing-based 
studies of human CRC samples (J, 2), and 
mechanistic data from preclinical mod- 
els have since demonstrated a panoply of 
roles for F nucleatum in several aspects 
of CRC biology. In vitro, F: nucleatum pro- 
motes CRC cell proliferation; in mice, the 
presence of F nucleatum in patient-de- 
rived CRC xenografts (where patient CRC 
samples are implanted in mice) increases 
tumor growth rates (3). A possible mecha- 
nism to explain these findings is that the 
F. nucleatum adhesin, FadA, binds to E- 
cadherin on the CRC cell surface and ac- 
tivates oncogenic Wnt/B-catenin signaling 
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(4). F. nucleatum can also alter the func- 
tion of tumor-infiltrating lymphocytes and 
natural killer (NK) cells by binding to the 
inhibitory immune receptor TIGIT (T cell 
immunoreceptor with Ig and ITIM do- 
mains) through another adhesin, Fap2 (5). 
Fap2 also binds a disaccharide sugar motif 
[galactose N-acetyl-p-galactosamine (Gal- 
GalNAc)], which is expressed at high levels 
on the surface of many tumor cell types, 
as well as other cell types, and facilitates 
F. nucleatum binding to CRC cells and en- 
richment in CRC (5). 

The mechanisms by which F. nuclea- 
tum affects carcinogenesis also extend to 
facilitating resistance to chemotherapy. 
In preclinical models of CRC with high 
tumor loads of F nucleatum, tumors are 
more resistant to the commonly used CRC 
chemotherapeutic, oxaliplatin. / nuclea- 
tum activates autophagy (a cellular recy- 
cling process that influences cell survival) 
through Toll-like receptor 4 (TLR4) ex- 
pressed on CRC cells, rendering these tu- 
mors more resistant to oxaliplatin-induced 
cell death (5). 

Other bacteria, such as enterotoxigenic 
Bacteroides fragilis (ETBF), are long- 
studied human GI pathogens that cause 
diarrhea and GI inflammation. ETBF also 
potentiates colorectal carcinogenesis in 
mice, and recently, ETBF was detected in 
biofilms coating human CRCs and precan- 
cerous colonic lesions called adenomas (6). 
Similarly, Escherichia coli expressing the 
genomic island polyketide synthase (pks*) 
enhance tumorigenesis in preclinical CRC 
models and are enriched in human CRC tis- 
sues (7). pks* E. coli produce the genotoxin 
colibactin, which alkylates DNA, resulting 
in DNA adducts (a form of potentially mu- 
tagenic DNA damage) in colonic epithelial 
cells. The recent biochemical resolution of 
this process illustrates how host-microbe 
interactions might lead to cancer (8). 

There is still much to learn from this 
triad of species, including whether they 
interact sequentially, in tandem, or at all 
to influence colorectal carcinogenesis. Al- 
though continuing to study and translate 
knowledge of F nucleatum, ETBF, and 
pks* E. coli for human health is important, 
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these are not the only microorganisms that 
are relevant for CRC. Therefore, a valuable 
resource needed for the advancement of 
CRC-microbiota studies is an atlas to map 
not only what microorganisms are pres- 
ent in the tumor microenvironment but 
also where they are within and on tumors, 
how they interact with one another and 
the host, and how their arrivals and depar- 
tures over time shape the evolving tumor. 
The questions that such a cancer-micro- 
biome atlas can address are wide-ranging. 
For example, what are the consequences, 
if any, of the presence of a given micro- 
biota for how cancer cells function, the 
metabolic wiring of tumors, the oncogenic 
programs that drive growth and metas- 
tasis, and whether and how a cancer will 
respond to therapy? A nascent area of spe- 
cial interest is understanding how the gut 
microbiota in a patient may influence why 


thousands of individuals who have been, and 
continue to be, closely studied over several 
decades prior to, and at presentation with, 
colorectal adenomas and CRCs. Such longi- 
tudinal studies are ideal for understanding 
how the microbiota may be associated with 
CRC risk or influenced by other modifi- 
able or protective factors. Broad screening 
registries for CRC such as the United King- 
dom’s National Health Service Bowel Cancer 
Screening Programme, in which millions of 
fecal occult blood test (FOBT) kits have been 
collected from individuals aged 60 to 74 bi- 
ennially, also provide rich opportunities to 
study the stool microbiome and its correla- 
tion with CRC over time. 

Evaluating the associations that emerge 
from population-based studies in model sys- 
tems remains an essential step in moving 
microbiota discovery toward CRC diagnos- 
tics and therapeutics. Preclinical models in- 


peutic targets as well as allow for patient 
stratification and prognostication. 
Candidate microbes and metabolites 
gleaned from longitudinal prospective 
cohort and cross-sectional studies of 
colorectal adenomas and CRCs can also 
be evaluated in gnotobiotic animal models 
with tractable genetics, such as fruit flies, 
zebrafish, and mice. Gnotobiotic models 
are entirely devoid of endogenous mi- 
crobes or harbor small, well-defined micro- 
bial communities. They are a useful system 
for unraveling the multiplicity of influ- 
ences that microbes and their metabolites 
have in cancer development and cancer 
therapeutic efficacy and toxicity. Recently, 
testing healthy human stool samples and 
cocktails of human-derived microbial iso- 
lates in gnotobiotic mouse CRC models led 
to the observation that groups of bacteria 
can shape interferon-y-expressing (IFNy*) 
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F. nucleatum expresses adhesins and 
lipopolysaccharide (LPS), which can have 
multiple influences on cell behavior. 


ETBF coats tumors and recruits other bacteria 
to create a biofilm. ETBF also recruits immune 
cells and promotes inflammation. 


Polyketide synthase-expressing (pks*) E. coli 
may influence carcinogenesis through the 
generation of potentially mutagenic DNA adducts. 


Gal-GalNAc, galactose N-acetyl-D-galactosamine; miRNA, microRNA; TIGIT, T cell immunoreceptor with Ig and ITIM domains; TLR4, Toll-like receptor 4. 


immunotherapies work or fail in patients. 
Given that geography, diet, environmen- 
tal exposures, and lifestyle choices all in- 
fluence the composition of an individual’s 
microbiota, it is important that such meta- 
data are collected and studied in global CRC 
cohorts. This is crucial to begin to make 
sense of a range of findings including con- 
flicting CRC stool and tumor microbiome 
meta-analyses (5) and variation in chemo- 
therapy toxicities (side effects) observed in 
CRC patients from different continents. 
Patient populations provide a window of 
opportunity to study cancer and the micro- 
biota. However, prospective, longitudinal 
cohorts of healthy populations are also of 
critical importance to understand factors 
that not only increase susceptibility for CRC 
but also decrease CRC risk. The Nurses’ 
Health Study and Health Professional Fol- 
low-up Study have enrolled hundreds of 
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cluding organoids (tissue cultures that form 
three-dimensional organlike structures) and 
animal models are resolving whether and 
how microorganisms are disease drivers 
and key influencers. Organoids can now be 
generated from patient tumor tissue, can be 
genetically manipulated to mirror the acqui- 
sition of critical genetic mutations observed 
in patient tumors, and can be transplanted 
back into animal models to study tumor 
growth and spread in vivo (9, 10). When 
cocultured with microbes or microbial me- 
tabolites, organoid systems, especially when 
nonepithelial host cell components such as 
stromal cells and immune cells are included 
(1D, provide a powerful, reductionist system 
to directly interrogate the roles of microbes 
and their products. Studies from model sys- 
tems facilitate the identification of specific 
microbial metabolites and host-microbial 
signaling pathways that may provide thera- 
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CD8* T cell populations to promote antitu- 
mor immunity (12). Similarly, gnotobiotic 
mouse tumor models are being used to 
evaluate whether a cause of nonrespon- 
siveness to a cancer treatment, such as im- 
munotherapy, results from microbiota (13, 
14). From these and other studies, single 
microorganisms, consortia of microbes, 
and microbial metabolites have been iden- 
tified that are being developed to improve 
cancer treatment efficacy. 

Interventions targeting the microbiota 
and cancer frequently mirror the spectrum 
of preventive and therapeutic strategies 
for infectious diseases, including vaccines, 
antibiotics (as well as antivirulence drugs), 
fecal microbiota transplants (FMTs), and 
phage-based therapeutics. A recent proof- 
of-principle study demonstrated that the 
antibiotic metronidazole could slow the 
growth of F. nucleatum-positive human 
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tumor samples in patient-derived xeno- 
graft mouse models (3). However, it is un- 
clear whether antibiotics could be used in 
CRC prevention, given the years to decades 
over which cancerous lesions in the colon 
develop and the consequent potential of 
developing antibiotic resistance. Highly 
selective antibiotics or antivirulence ap- 
proaches might represent a more viable 
strategy that is less disruptive for human 
microbial ecology. FMT, the transfer of 
fecal material from healthy individuals 
to patients, is increasingly being used to 
combat colitis caused by antibiotic-resis- 
tant Clostridium difficile infection. More 
recently, FMT has been considered for 
the treatment of a number of diseases, in- 
cluding obesity and inflammatory bowel 
disease, with several clinical trials under 
way. FMT has also been piloted in a small 
number of cancer patients who developed 
severe colitis associated with the use of 
immunotherapy (15). 

Beyond FMT, carefully curated cocktails 
of microbes are being tested for C. difft- 
cile colitis and for CRC. The goal of such 
a therapeutic is to use microbial consortia 
to “push out” or exclude a disease-associ- 
ated microorganism from a patient’s gut 
or tumor. Vaccines also hold tremendous 
potential for cancer prevention, with can- 
cer-associated microbes and tumor neoan- 
tigens used to elicit antitumor immunity. 
For many CRC-potentiating microbes, their 
toxins, adhesins, and outer membrane pro- 
teins are attractive vaccine targets. Addi- 
tionally, exciting opportunities also exist 
for using microbiota profiling information 
not only in CRC prevention, diagnostics, 
and therapeutics but also for treatments 
targeting GI microorganisms to decrease 
the toxicities of CRC therapeutics. 
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Inuit harvester Julia Ogina, of the Kitikmeot Inuit Association (left), shares her knowledge on muskox health 


with wildlife veterinarian Matilde Tomaselli (right), in Cambridge Bay, Nunavut, Canada. 


BRIDGE: INDIGENOUS AND SCIENTIFIC KNOWLEDGE 


“Two-eyed seeing” supports 


wildlife health 


Bridging Indigenous and scientific knowledge improves 
wildlife surveillance and fosters reconciliation 


By Susan Kutz! and Matilde Tomaselli!” 


he cry “Don’t shoot the leaders!” is 

central to the traditional knowl- 

edge of Indigenous peoples across 

the Canadian North. For countless 

generations, northern Indigenous 

peoples have witnessed the annual 
caribou migrations, understanding their 
mechanisms and patterns. They know that 
if the caribou leading the migration are re- 
moved, the rest of the herd do not know 
where to migrate and will not return to the 
traditional harvesting grounds. A recent 
scientific study on the migratory behavior 
of hoofed animals also concludes that they 
learn from their conspecifics where and 
when to migrate (J). Experiential-based 
knowledge such as that of the northern In- 
digenous peoples, acquired through prac- 
tice and over generations, has been central 
to human adaptation and survival for mil- 
lennia. Combining this knowledge with 
scientific knowledge will help to achieve 
better-informed and more timely and ef- 
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fective decision-making on wildlife health 
and conservation. 

The urgency of ethically documenting 
and effectively using local knowledge for 
wildlife health surveillance is underscored 
by the current rate of climate change in 
the Arctic and elsewhere (2), where dis- 
ease emergence and wildlife population 
declines are threatening species survival (3) 
and the livelihoods, food safety, and food 
security for Indigenous peoples (4). Many 
of the principles that apply to local knowl- 
edge held by Indigenous peoples also apply 
to local knowledge held by nonindigenous 
people around the world. 


VALUING INDIGENOUS KNOWLEDGE 

Accounts related by Indigenous knowledge 
holders, and even local idioms, provide 
tremendous insights into wildlife biology, 
health, and disease ecology. People living 
in close intimacy with their environment 
have a comprehensive understanding of 
the dynamic nature of the systems in which 
they live, including indicators of ecosys- 
tem health and their natural variability. 
Systematically documenting Indigenous 
knowledge can result in early detection of 
ecological changes (5). For example, obser- 
vations by subsistence hunters of thinner 
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animals can predict an impending popula- 
tion decline long before detection by con- 
ventional population surveys. 

Whether it be overt events such as dis- 
ease outbreaks or more insidious population 
changes, early detection is essential to re- 
spond effectively to conserve wildlife and to 
protect human and domestic animal health, 
food security, and livelihoods (6). Conserva- 
tion organizations are increasingly recogniz- 
ing that respecting Indigenous knowledge is 
essential for equitable development and en- 
vironmental sustainability (7). 

Despite these advances, application of In- 
digenous knowledge for wildlife health as- 
sessment and conservation is in its infancy, 
and there remains uncertainty, and at times 
skepticism, among decision-makers on how 
to combine local perspectives with scientific 
insights for evidence-based decision-mak- 
ing around wildlife. Fundamental barriers 
include the “need to meet scientific cred- 
ibility” to produce reliable accounts and the 
difficulty to “translate user perceptions into 
categories/criteria that rely on numbers” 
(8). Addressing these barriers does not 
mean that Indigenous knowledge must be 
coopted into the scientific knowledge sys- 
tem or to be validated with science; rather, 
it requires engagement in meaningful dia- 
logues that enable different knowledges to 
interface. 


THE POWER OF “TWO-EYED SEEING” 
Recognizing the legitimacy of multiple 
ways of knowing is the first step in bridg- 
ing knowledge types. This includes mutual 
respect for different philosophical theories 
and assumptions underlying knowledge 
systems, acknowledging limitations, and 
capitalizing on strengths (9, 0). 

Scientific knowledge is widely regarded 
as the gold standard because of its focus on 
measurable and objective criteria. However, 
scientific surveillance of wildlife health is 
often limited in space and time, only docu- 
ments certain types of information, and is 
difficult to interpret when data are frag- 
mented. These limitations are especially 
evident in remote settings where fieldwork 
is expensive, logistically difficult, and often 
restricted to specific months of the year. 

Indigenous knowledge, a holistic body of 
knowledge acquired through experience at 
broad spatial and temporal scales, cumula- 
tively captures information across multiple 
interdependent elements (10). Indigenous 
knowledge holders generally cannot talk 
about caribou populations, behavior, or 
health without discussing the environment, 
wolves, insects, and caribou-human inter- 
actions. In many ways, Indigenous knowl- 
edge provides an ideal example of the “One 
Health” approach, in which animals and the 
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complexities of their environment, including 
the human element, are considered simulta- 
neously. Unfortunately, the legacy of colonial 
practices, together with rapid socioecological 
changes and an aging population of knowl- 
edge holders, has led to the breakdown of in- 
tergenerational transmission of Indigenous 
knowledge, putting the continuity of this 
rich knowledge system at risk. 

Bridging multiple knowledge systems 
requires drawing on natural and social 
sciences’ methodologies and constant con- 
sideration for the value systems of all knowl- 
edge holders, a process that is based on 
ongoing iteration and feedback (see fig. S1). 
The Mi’kmaq principle of “Etuaptmumk” 
or “two-eyed seeing” captures the concept 


from resource users and conventional veter- 
inary methods. This approach was essential 
for the global eradication of the infectious 
viral disease rinderpest: Pastoralist knowl- 
edge identified the last foci of rinderpest in 
remote areas of East Africa where scientific 
methods had failed, guiding targeted con- 
trol (22). 

In Arctic Canada, participatory epide- 
miology was used to understand health 
in a declining muskox population (5). 
Knowledge of Inuit harvesters identified 
and characterized changes in population 
demographics, condition, emerging dis- 
ease syndromes, and mortality. Together, 
scientific and Inuit knowledge generated 
a greater understanding of mechanisms 


Indigenous knowledge of bowhead whale behavior provides new insights that are critical for management. 


of bringing different knowledge systems 
together to increase our collective breadth 
and depth of understanding: “learning to 
see from one eye with the strengths of In- 
digenous knowledges...and from the other 
eye with the strengths of Western knowl- 
edges...and learning to use both these eyes 
together, for the benefit of all” (77). 
Participatory epidemiology provides a 
pragmatic framework for applying this 
principle to understand wildlife health. 
Participatory epidemiology was developed, 
and is still largely applied, in low-income 
countries to enhance livestock disease sur- 
veillance. The two-eyed seeing principle 
is achieved through triangulation, or cor- 
roboration of information by using multiple 
methods and sources, including knowledge 
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that drive muskox populations. The utility 
of participatory epidemiology applied to 
wildlife health extends beyond the Arctic 
and Indigenous peoples—particularly to de- 
veloping countries, where data on wildlife 
are often scarce, and strong interactions be- 
tween people, wildlife, and livestock create 
hotspots for emerging zoonoses. 


OVERCOMING BARRIERS 

Bridging Indigenous and scientific knowl- 
edge holds considerable promise in wildlife 
health ecology and surveillance; however, 
challenges remain. As the combined use 
of Indigenous and scientific knowledge 
becomes increasingly encouraged and 
mandated, there is risk of implementing 
approaches that produce power imbalances 
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or include Indigenous knowledge only su- 
perficially. To ethically bridge knowledge 
systems, meaningful engagement with 
knowledge holders is necessary at all stages, 
from project design to data interpretation 
and project evaluation (10, 13). 

To move from anecdote to the data that 
will be used by decision-makers, docu- 
mentation of local knowledge must follow 
rigorous qualitative research design, includ- 
ing repeatable and transparent methods 
(‘need to meet scientific credibility”) (5, 9). 
Identification of expert knowledge hold- 
ers, thematic saturation and triangulation 
(qualitative research methods used to avoid 
missing and to corroborate information, 
respectively), and validation and cointer- 
pretation of results will ensure accurate ac- 
counts, interpreted and applied in context, 
and will serve as an internal peer review 
process (5). 

Through participatory appraisal exer- 
cises (such as mapping, timelines, and pro- 
portional piling), qualitative information 
from resource users can be translated into 
spatial, temporal, and numerical represen- 
tations (“into categories/criteria that rely on 
numbers”) (5). These types of data enable 
inclusion of local knowledge into familiar 
formats used for management decisions. 
As with scientific data, quantifying local 
observations allows us to understand vari- 
ability around observations and act with 
knowledge of that uncertainty. However, 
qualitative and quantitative data must be 
documented and interpreted together to 
avoid misinterpreting information and loss 
of deeper insights. 

Growing efforts to incorporate local 
knowledge into quantitative models [such 
as Bayesian Belief Networks (J4)] can pro- 
vide tools that further facilitate inclusion of 
this knowledge into decision-making. How- 
ever, sophisticated mathematical models 
can pose technical barriers to participation, 
producing power imbalances (75). When ap- 
plying these tools, it is paramount to ensure 
that Indigenous knowledge holders actively 
participate at all stages and share control of 
the process. 

Inevitably, at times, Indigenous and sci- 
entific knowledges will conflict. Partners 
then need to coevaluate the strengths and 
limitations of the respective studies and 
jointly reflect on disparate findings. Such 
a contention arose in 1977 between scien- 
tists and indigenous whalers around the 
number of bowhead whales, which resulted 
in a hunting ban (0). This conflict was re- 
solved only when the knowledge of Ifupiat 
whalers on bowhead whale behavior was 
considered, leading scientists to realize that 
they had severely underestimated the stock 
size based on faulty assumptions. This ex- 
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ample is a reminder that such instances 
of conflict are opportunities for the great- 
est insights into ecological processes to be 
revealed, transformative knowledge gener- 
ated, and improved dialogue among part- 
ners achieved, enabling the development of 
shared solutions. 

“Don’t shoot the leaders” echoes as a call 
for action to enable today’s elders to pass 
their knowledge to the youth who will be 
tomorrow’s leaders. The creation of an in- 
clusive platform for knowledge exchange 
around wildlife health, a theme so central 
to many Indigenous cultures, improves 
wildlife conservation and public health 
protection, promotes intergenerational 
learning and retention of Indigenous 
knowledge within contemporary contexts, 
and contributes to reconciliation with In- 
digenous peoples. Respecting Indigenous 
knowledge and working collaboratively 
with knowledge holders will build capac- 
ity and empower Indigenous communities 
to set their own agendas as advocated by 
the United Nations (7). The latest report by 
the Intergovernmental Panel on Climate 
Change (IPCC) highlights the rate at which 
climate change is affecting the Arctic and 
other regions of the world (2), emphasiz- 
ing the urgency with which we all need to 
embrace the concept of Etuaptmumk, to 
cogenerate knowledge and to learn from 
each other “for the benefit of all.” 
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Squeezing 
out higher 
precision 


Well-timed kicks to an ion’s 
momentum enable better 
position measurements 


By Monika Schleier-Smith 


aking measurements at a preci- 

sion limited only by quantum 

uncertainty is crucial for applica- 

tions such as the detection of grav- 

itational waves (1) or the search 

for dark matter (2). In this quest, 
physicists can reduce uncertainty in mea- 
surements of one variable at the expense 
of another (3, 4). Ultimately, however, the 
precision is often limited by the classical 
noise of the measurement apparatus. In 
principle, a solution to this problem is to 
amplify the signal relative to the classical 
noise. Readers familiar with cassette tapes 
may recognize this strategy, which is much 
like turning up the volume of music in a 
car so that it can be heard above the engine 
noise, only to have the tape hiss obscure 
the finer features of the music. On page 
1163 of this issue, Burd et al. (5) report 
extremely sensitive measurements of the 
position of an individual ion, achieved by 
both suppressing quantum noise and am- 
plifying signal. 

Trapped ions are exquisitely well-con- 
trolled quantum systems, used as ultrapre- 
cise clocks (6) and as building blocks for 
quantum computers (7, 8). Electric fields 
can be used to suspend a single ion in ul- 
trahigh vacuum, and lasers are then used 
to bring it to rest in its lowest energy state. 
Even in this stable resting state, the ion has 
zero-point energy associated with quantum 
fluctuations in its position and momentum. 
In the case of the low-mass magnesium 
atom studied by Burd et al., the motion cre- 
ated by the zero-point energy is 70 times the 
size of the ion in its trapped ground state. 
Does this mean that precise measurements 
of its position are futile? 

The limit imposed by the Heisenberg 
Uncertainty Principle makes knowing both 
an object’s position and its momentum at 
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the same time impossible. However, we 
can know the position alone to arbitrary 
precision. When the imprecision becomes 
smaller than the quantum uncertainty 
of the resting state, the result is called a 
squeezed state. Squeezing the uncertainty 
in position requires hiding all information 
about the object’s momentum, which in 
turn requires pumping in energy. The en- 
ergy must, however, be added carefully to 
avoid just heating the ion up and increas- 
ing the uncertainties in both momentum 
and position. 


extend and contract your legs, you can am- 
plify either your initial momentum or your 
initial displacement, but not both, as the 
other will be attenuated. Hence, Burd et al’s 
amplification of the initial quantum fluctua- 
tions in momentum at the same time attenu- 
ates the quantum fluctuations in position, 
which places the ion into a squeezed state 
(see the figure). At a later time during the 
experiment, the same pumping trick ampli- 
fies a perturbation of the ion’s motion, which 
is the signal that the experimenters wish to 
precisely measure. In amplifying the signal, 


Improving sensitivity to small displacements 
Carefully manipulating the quantum fluctuations in an ion's motion improves the measurement precision. 


Energy 
lon 7 
lon trap Probability 
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Squeeze 
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Ground state 
quantum 
uncertainty 


The trapped ion is in its 
lowest energy state with an 
uncertainty in position and 
momentum (blue circle) 
set by quantum mechanics. 


The ideal way to pump in energy, it turns 
out, is known to every child who has sat 
on a swing and pumped her legs. Imagine 
that the child starts out sitting as still as she 
can, like an ion at rest in its trap. Her posi- 
tion nonetheless might be ever so slightly 
perturbed from equilibrium, depending on 
the direction of the latest gust of wind. If 
she now starts to pump her legs, she can 
amplify that slight impulse. A distant ob- 
server who was unable to discern the size 
or direction of the initial perturbation will 
nonetheless easily see the amplified motion 
with each pump of the child’s legs. Burd et 
al. achieve the same effect by modulating 
the electric fields that trap the ion, relaxing 
and tightening its confinement in the same 
rhythm as the child extending and com- 
pressing her legs. 

You may recall from your own childhood 
that the timing of this rhythm is crucial, to 
avoid inadvertently slowing yourself down. 
Specifically, depending on exactly when you 
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Squeezed 
state 


The energy of the ion is increased 
by modulating the electric field, 
which increases uncertainty in 
momentum but decreases it in 
position below the quantum limit. 


Displace Unsqueeze 


The ion is displaced 
while in the 
squeezed state. 


they also amplify the quantum noise back 
to its original unsqueezed level. As a result, 
even with a measurement apparatus that can 
barely resolve the quantum noise of the ini- 
tial state, the complete protocol can detect 
much smaller displacements, down to the 
level of the squeezed uncertainty. 

This principle of amplified quantum sens- 
ing has been used to enhance optical inter- 
ferometers (9) and has been demonstrated 
in atomic clocks and sensors whose signals 
are stored in internal states of an ensemble 
of many atoms (J0-12). Its application to de- 
tecting tiny displacements of a single parti- 
cle, however, required surmounting different 
technical challenges. The authors showed 
amplification of the quantum motion with- 
out adding any appreciable extra noise. In 
other words, they pumped energy into the 
system without heating it. They detected 
a displacement of less than 1 nm, which is 
smaller than the initial quantum uncertainty 
in the ion’s position by a factor of 7. 
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ton 
even more precise. 
Extended to groups of 
= / multiple ions, noiseless 
amplification can enhance 
interactions that allow 


Amplifying the displacement 
returns the original 
quantum uncertainty, and 
both are now larger 

than the measurement error. 


The study by Burd et al. is an important 
milestone in advancing toward ultimate 
quantum limits in precision measurement 
and opens the door to several applica- 
tions. One prospect is to make ions even 
better at what they already do best, which 
is precision spectroscopy. For example, by 
comparing the absorption spectra of differ- 
ent ionic species, it is possible to address 
fundamental questions such as whether 
the constants of nature are really constant. 
One method of measuring these spectra 
is notable for being applicable to a wide 
range of different ions and 
relies on detecting the tiny 
momentum kick imparted 
when an ion absorbs a pho- 
ton (73). Quantum squeez- 
ing and amplification can 
make this method of pho- 

recoil spectroscopy 


for transmitting quantum 
information between the 
ions (14) and may thus ulti- 
mately find applications in 
quantum computing. An- 
other exciting prospect is to 
precisely detect the motion 
of increasingly massive ob- 
jects that can be cooled to 
the quantum regime (/5). A 
grand challenge is to detect 
minute gravitational forces 
in a regime where quantum 
mechanics simultaneously 
plays a role. Every bit of 
extra precision that can be squeezed from 
such systems will be essential to achieving 
that goal. 


Measured 
displacement 
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RESEARCH FUNDING 


Government-funded research 
increasingly fuels innovation 


Nearly a third of U.S. patents rely directly on federal research 


By L. Fleming’, H. Greene’, G. Li’, 
M. Marx, D. Yao* 


nnovation increasingly relies on sci- 

entific knowledge (J, 2). Research to 

generate that knowledge has histori- 

cally been funded by both industry and 

government. Although industry and 

government research spending was rel- 
atively equal in 1980 in the United States, 
by 2010 their shares had shifted to 60% and 
30%, respectively (3). Yet, despite this in- 
crease in industrial spending, firms appear 
to be pursuing—or at least publishing—less 
basic science (4). If corporations are doing 
less basic research, then where do they find 
the ideas to fuel their innovation? Here, 
we detail individual bibliometric linkages 
across tens of millions of documents and 
quantify the broad sweep and impact of 
U.S. federally supported research on pat- 
ented innovation over most of the past 
century. We illustrate how patentees, both 
US. and non-U.S., and corporations in par- 
ticular, increasingly depend upon federally 
supported research as a source of scientific 
knowledge. Although multiple mechanisms 
interact and contribute to the trend, fed- 
eral research increasingly appears to fuel 
the innovation that ultimately leads to jobs, 
industrial competitiveness, and entrepre- 
neurial success. 

Though research has established the 
rise of “open innovation” communities, 
research consortia, and markets for ideas 
as sources of innovative ideas (5), the role 
of government funding for research has 
not been ignored. Recent research—at the 
micro level of individual science papers 
and patents—has quantified how 10% of 
grants awarded by the U.S. National In- 
stitutes of Health (NIH) generate pat- 
ents (2). Macro-level and nonquantitative 
histories have also described the broad 
sweep of government impact (6). Here, we 
consider every U.S. patent assigned to U.S. 
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inventors since 1926, and show that the 
proportion of patents relying on federally 
funded science has outstripped the overall 
increase in patenting, plateauing since a 
high of 30.0% in 2011 (see the first figure). 
Despite this plateau, the absolute number 
of patents that rely on federally supported 
research has almost doubled recently, from 
22,647 in 2008 to 45,220 in 2017. We count 
reliance when a patent is owned by the 
government, acknowledges government 
support, or directly cites a patent or sci- 
ence paper that acknowledges support. 
Corporations, identified as the owner of a 
patent from its “assignee” field, account for 
most of the increased reliance on govern- 
ment-supported research since the 1970s 
(see the second figure). Corporations have 
increased their own acknowledgment of 
support and their citation of government- 
supported science papers and patents [al- 
though alternatives cannot be entirely ruled 
out, see supplementary materials (SM) for 
evidence that the trends are robust to more 
versus less stringent definitions of reliance 


and also do not result from a change in re- 
porting requirements or in response to re- 
quirements, an increase in the citable stock 
of federal research, or a mechanical process 
due to an increase in the number of cita- 
tions per patent]. 

The sharp increase following World War 
II (WWII) illustrates the invention behind 
the vision articulated by Vannevar Bush in 
1945, of how federal investment in science 
research could train the technical work- 
force and support innovation, resulting in 
“..new products, new industries, and more 
jobs...new and improved weapons” (7). 
The almost monotonic increase in reliance 
has been partially enabled by the steady 
though slower rise in inflation-adjusted 
federal research dollars over the time pe- 
riod, even as federal science investment as 
a percentage of gross domestic product has 
decreased (8). 

Prior to the 1970s, much of the govern- 
ment-supported patenting in the United 
States occurred inside the government. 
Universities, often supported by govern- 
ment research grants, began patenting 
more in the 1980s, motivated in part by 
the Bayh-Dole Act, which sought to spur 
commercialization through assignment of 
property rights to universities. Lone in- 
ventors constituted a large proportion of 
reliance upon government research in the 
early part of the last century, and their reli- 
ance has remained steady. 

Although some portion of the overall 
increase shown in the first figure results 
from a change in the composition of pat- 
ented technologies, growing investment in 


Patentees increasingly depend upon federally supported research 
Total granted U.S. patents by U.S. inventors (blue bars), and subtotal that rely on federal research (orange 
bars), and proportion of patents (black line = orange bars/blue bars) that rely on federally supported research. 
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health research, and an accompanying rise 
in patents supported by the Department of 
Health and Human Services (HHS) (which 
includes the NIH), all technologies have 
relied increasingly upon government sup- 
port, albeit to varying degrees. Federal sup- 
port for U.S. invention gets routed through 
a variety of agencies. In 2017, for example, 
the Department of Defense (including the 
armed services) supported 6.2% of the total 
of U.S. inventions, HHS 5.4%, Department 
of Energy 3.9%, National Science Founda- 
tion 2.9%, NASA 1.0%, and Department of 
Agriculture 0.50% (others 5.5% and unspec- 
ified 2.5%; see SM for illustrations of R&D 
investment and patenting by agency and by 
technology class). 

Large, small, and micro firms all draw 
upon government research, as observed 
in patent-renewal data. Startups also de- 
pend heavily on government research, as 
34.6% of the 121,765 patents assigned to 
venture-backed companies from 1976 to 
2016 cited federally supported research; 
by comparison, for all corporate patents 
during the same time period, only 21.7% 
rely on federally supported research. The 
higher reliance of startups on government- 
supported research may be explained by 
resource constraints, which inhibit inter- 
nal R&D expenditure in young ventures 
and encourage identification and external 
licensing of promising technologies. It also 
illustrates the intended consequence of the 
Bayh-Dole Act and an important path of 
commercialization for federal science and 
technology research. 

Foreign inventors’ reliance on federal re- 
search in their U.S. patenting has steadily 
increased since WWII (see the third fig- 
ure), though the proportion of total foreign 
patenting in the U.S. system that relies on 
federal research still lags behind that of 
U.S. inventors (in 2017, 28.2% of U.S. pat- 
ents by U.S. inventors, and 12.4% of the 
U.S. patents by foreign inventors, relied 
on federally supported research). China 
has recently begun to rely more heavily on 
US. federal research; however, its reliance 
is still small relative to most (in order of 
reliance in 2017): Japan, Germany, Korea, 
England, France, China, Taiwan, and India. 
The greatest recent increase has come 
from the category of “other” nations, per- 
haps reflecting easier access via the web 
and other electronic libraries to the patent 
and scientific literature. 

Many factors make it more difficult for 
foreign inventors to gain access to U.S. sci- 
ence—including language differences, the 
cost of international travel, fewer social ties 
between scientists and engineers, problems 
of security clearances and access to sensi- 
tive technologies, and surely less mobility 
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from (U.S.) university positions into private 
firms in foreign countries. Moreover, coun- 
tries vary in their degree of local specializa- 
tion and may find U.S. research less helpful, 
if working in areas not supported by U.S. 
research. Although other nations’ inven- 
tors have increased their reliance on fed- 
eral research, the rate of increase appears 
less than that of U.S. inventors, even as the 
world’s scientific and technical knowledge 
has become digitized and thus more easily 
searched and used. 

Corporate patents that rely on federal 
research are consistently more impor- 
tant than those that do not, as measured 
by the number of prior art citations from 
subsequent patents (such citations estab- 
lish the departure point of a new patent). 
Comparing population averages for all U.S. 


tation increase arises from both the firm’s 
subsequent patents (1.53 additional self- 
citations on average) and citations in other 
firms’ patents as well (1.86 additional ci- 
tations from other organizations on aver- 
age), indicating that both the inventing 
firm and its competitors find these techno- 
logical trajectories more fertile. 

The same pattern holds for corporate 
patents that cite federally supported sci- 
ence publications relative to those that 
cite science publications that are not gov- 
ernment supported, with a matched and 
paired mean difference of 3.57 citations, 
and a mean difference in self- and nonself- 
citations of 1.17 and 2.41, respectively. Prior 
research has estimated that an additional 
citation is associated with an increase in 
the value of the patent of up to 3% (9). 


Reliance upon federal support has been dominated by corporations 
Stacked area chart of percentage of total U.S. patents by U.S. inventors that are owned by government or rely 
on federally supported research papers or patents, by type of patent owner. 
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corporate patents granted in 2010, patents 
that rely on federal research receive 6.33 
citations on average in the 5 years follow- 
ing the grant versus 4.42 for those that do 
not. Additional panels in 2000, 1990, and 
1980 show similar but not always signifi- 
cant effects, possibly because of thinner 
data or changes in corporate patenting 
strategy (see SM). These population aver- 
ages aggregate a great variety of different 
technologies, industry-specific patenting 
strategies, and number of inventors on the 
patent, however, which motivates a com- 
parison of similar types of patents. Match- 
ing and pairing corporate patents that cite 
federally supported patents to similar cor- 
porate patents that did not (see SM), the 
former receive on average 3.39 more cita- 
tions in the 5 years after issuance. The ci- 
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Citation increases also benefit the larger 
economy as citations that occur from out- 
side the inventing firm can be interpreted 
as knowledge spillovers and positive ex- 
ternalities from the inventing firm to the 
larger economy (10). 

Consistent with the citation measure and 
again comparing matched patents, patents 
that rely on federal research are slightly 
more likely (<2%) to pay renewal fees (re- 
quired payments that keep the patent in 
force) after 4 years. They are also more likely, 
on average, to introduce words that are new 
to the patent corpus, indicating foundational 
patents with greater novelty (1). 

Although the bulk of reliance historically 
can be traced through citation of prior pat- 
ent art, U.S. inventors have recently shifted 
their citation patterns to favor more im- 


sciencemag.org SCIENCE 


GRAPHIC: N. CARY/SCIENCE 


GRAPHIC: N. CARY/SCIENCE 


mediate citation of government-supported 
science papers. The average percentage of 
prior art citations within each patent that 
are supported by federal research fell from 
a high of 15.8% in 2003 to 9.9% in 2017. By 
contrast, citation of scientific papers that 
are supported by federal research reached 
a historical high of 10.7% in 2017, up from 
6.9% in 2004 (see SM). 

If federal science research is so impor- 
tant to corporations, why don’t they under- 
take all (or at least more of) their research 
internally? Putting aside the question of the 
optimal rate and types of corporate taxation, 
why should the government subsidize private 
firms through federally funded and managed 
research investment? Much research has ar- 
gued and demonstrated that corporations 
probably underinvest in research (12), ow- 


Our bibliometrics enable explicit link- 
age of government support and output but 
remain vulnerable to many limitations in 
methodology, interpretation, and causal at- 
tribution. Patents are only one—and an ad- 
mittedly imperfect—measure of innovation, 
particularly prior to the availability of com- 
prehensive data in the mid-1970s. Moreover, 
the use of patents varies across industries 
and over time (though the matched analysis 
above controls for technology, number of 
inventors, and time). Our focus does not ex- 
tend, for example, to innovations protected 
as trade secrets or through copyright. 
Other factors that complicate causal inter- 
pretation include the concurrent (though 
slower) rise in federal spending on science 
over the period, changes in compliance 
requirements (13), changes in corporate 


Foreign reliance on federal research has steadily increased 

Proportion of U.S. patents by non-U.S. inventors that rely on federally supported research, stacked by country 
(for comparison, the black line above is not stacked, and illustrates the proportion of U.S. patents by U.S. 
inventors that rely on federally supported research, from the first and second figures). 
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ing to uncertainty and the difficulty of com- 
mercializing research by the firm doing the 
research (as opposed to that firm’s competi- 
tors). Furthermore, from a societal viewpoint, 
the discoveries from research are best distrib- 
uted widely throughout an economy, rather 
than being kept in a single firm, thus reduc- 
ing duplicative efforts. Research aimed at 
fundamental understanding thus constitutes 
a public good, typically precedes commer- 
cialization, and can be applied by many cor- 
porations, across many industries, for many 
years following publication. Although these 
results might be interpreted as evidence of 
short-sighted lack of research investment on 
the part of corporations, they can also be in- 
terpreted as evidence of the effectiveness of 
public investment in science and technology 
research in spurring innovation. 
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patenting and citation strategies, and im- 
provements in database and search tech- 
nology that enabled increasingly accurate 
acknowledgment of government support 
(see SM for discussion). 

Furthermore, this study does not com- 
pare observed output to the counterfactual: 
patenting and publishing that would have 
(not) occurred in the absence of federally 
supported research. Hence, one cannot esti- 
mate the extent to which federal support of 
R&D has crowded out private investment, 
though there is increasing evidence that 
thoughtful applications of such support can 
encourage additional and complementary 
private investment (2). 

These issues notwithstanding, these 
data provide a quantitative estimate of the 
changing impact of federal research on U.S. 
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patenting. They show that almost one-third 
of U.S. invention—and the more important 
part as measured by future citations, renew- 
als, and novelty—relies on federal research 
investment. Our measures conservatively 
use only direct linkages, ignoring dubious 
or missing acknowledgments of support as 
well as knowledge flow from second- and 
later-generation citations. These analyses 
also ignore the nonpecuniary (and possi- 
bly more substantial) benefits such as bet- 
ter weapons, improved health and medical 
practices, technical workforce training, di- 
rectly and indirectly supported jobs, and 
entrepreneurial ideas that spark startups 
(Google’s Page Rank patent acknowledges 
government support, for example). 

If the sponsoring nation’s firms can more 
effectively take advantage of that nation’s 
research—as appears to be the case to date 
in the United States—then firms can exploit 
a fundamental competitive advantage in 
international competition. Federal research 
increases the likelihood that new industries 
will locate in that country and provide jobs 
and productivity spillovers to older indus- 
tries. Policy-makers should consider these 
newly observable benefits of federal re- 
search when they formulate tax policy and 
science research budgets. 
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Exercising empathy 


Compassion can be cultivated, argues a psychologist 


By Sara Konrath 


oogle searches for “empathy” have 

been increasing over the past 15 

years, and it’s the latest buzzword 

at companies as diverse as Facebook 

and Ford. Joining several recent 

books on the topic [such as (J-3)] is 
The War for Kindness by Stanford psycholo- 
gist Jamil Zaki, which sets out to 
help people increase their empa- 
thy in sustainable ways. 

Zaki opens the book by reveal- 
ing that many people believe that 
empathy is a fixed trait with ge- 
netic underpinnings. He calls this 
the “Roddenberry hypothesis,” 
named after Gene Roddenberry, 
the creator of Star Trek, a show 
that featured characters at the ex- 
tremes of the empathy scale. 

Zaki is deeply opposed to this 
idea. His research finds that 
people can become more empathic with 
practice, and when people believe empathy 
is changeable, they spend more time and ef- 
fort helping others (4). This is important in 
a time of both declining empathy (5) and 
rising political polarization. 


The reviewer is director of the Interdisciplinary Program 
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THE WAR FOR 
KINDNESS 


The War for Kindness 
Building Empathy 
in a Fractured World 


Jamil Zaki 
Crown, 2019. 268 pp. 


Zaki is a compelling writer, and even an 
android could not help but respond to his 
prose. His chapters, which treat readers to 
a broad tour of the most interesting recent 
empathy research, read like a series of long 
essays. This is a clever approach because, 
as he notes, one effective empathy-building 
technique is storytelling. Zaki opens the 
book with the story of his parents’ acrimoni- 
ous divorce to show how powerful 
empathy can be when used as a 
tool to maintain relationships. By 
embracing each of their perspec- 
tives, he writes, he was able to 
remain connected to both parents 
even as they were increasingly 
disconnected from each other. 

The chapter “Caring too much” 
starts with another personal ex- 
ample: the story of the birth of 
Zaki’s daughter Alma, who suf- 
fered a stroke during a difficult 
birth, and the compassionate 
staff who cared for her during the early days 
of her life. He then artfully transitions into 
how neonatal nurses and others in similar 
professions can sometimes be negatively 
affected by others’ suffering. This is a very 
important discussion, especially in light of 
recent debates on the potential negative im- 
plications of empathy (2, 3). 

Yet after sharing the potential harms 
associated with too much empathy, Zaki 
confusingly shifts to the opposite conclu- 
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People can become more empathic with 
time and practice, finds Zaki. 


sion at the end of the same chapter, arguing 
that empathy helps to protect people from 
negative effects of stress. This betrays a 
common problem with the term “empathy,” 
which is its many definitions (6). If empathy 
is defined as personal distress or emotion 
sharing, there is indeed research that shows 
negative implications (7). Yet most empathy 
scholars would define it as more akin to 
compassion. Using this definition, several 
studies find that empathy actually serves as 
a buffer for burnout (8, 9). 

The book includes extensive footnotes, 
a detailed appendix on the definitions of 
empathy, and a second appendix that at- 
tempts to evaluate the evidence discussed 
in the book in an accessible way. Scientists 
will likely appreciate these sections, but 
they may not be satisfied by them, and lay 
readers will probably ignore them. Still, I 
understand why Zaki included them: to ap- 
pease persnickety academics such as me. 
Academic audiences, however, would ulti- 
mately be better served by more scholarly 
tomes [such as (10, 11)]. 

The War for Kindness may inspire readers 
to learn more, but Zaki’s goals go beyond 
sharing the science of empathy with the 
masses. He hopes to inspire people to ac- 
tually practice more kindness in their lives, 
writing in the book’s closing lines, “Building 
a new sort of empathy takes effort and sac- 
rifice, for people who might not repay it.... 
We each have a choice, and the sum of our 
choices will create the future. What are you 
going to do?” 

Although he does not offer clear action 
points, the stories told throughout the book 
reveal many ways to increase empathy. Ulti- 
mately, however, there is no quick fix. Build- 
ing empathy takes time and effort (12). Take 
heart, though; it gets easier with practice. & 
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PALEONTOLOGY 


The dinosaur as social capital 


A new history reveals how the wealthy elite helped shape 
modern natural history museums in America 


By Ilja Nieuwland 


he first 6 months of 1905 saw two 

prominent, celebrity-studded, and 

copiously publicized unveilings of 

full-size dinosaur exhibits. In Febru- 

ary, the American Museum of Natu- 

ral History (AMNH) in New York 
unveiled its Brontosaurus mount; in April, 
Andrew Carnegie’s donation of a cast of 
Diplodocus carnegii (his “namesake,” as 
he liked to call it) was presented to the 
British Museum in London. As Lukas Ri- 
eppel explains in his book, Assembling 
the Dinosaur, neither the timing nor the 
place nor the form of these events rested 
on coincidence. 

In the past decade, the “Long Gilded Age” 
of American natural history museums— 
roughly the last third of the 19th century 
and the first decade of the 20th—has been 
subjected to an unprecedented number of 
treatments (J-3). Rieppel’s book is a wel- 
come addition to this body of literature, 
not in the least because of his approach, 
which seeks to integrate dinosaurs’ status 
“as an object of technical knowledge with 
social, cultural, and economic history.” 

The result is a book about dinosaurs 
that features surprisingly few of them (and 
fewer fossil hunters still). If that sounds 
like criticism, it is not. Rieppel’s focus is 
on the people who used dinosaur mounts 
to convert monetary capital into social 
capital and, in so doing, built the natural 
history museums that define the museum 
landscape today. 

At the turn of the 20th century, a time in 
which big things mattered, dinosaurs were 
a natural point of attraction for wealthy 
plutocrats. And because natural history mu- 
seums could be instrumentalized to trans- 
late them into social capital and ideological 
metaphor, this created an almost natural 
bond between such institutions and wealthy 
benefactors looking for recognition. 

Rieppel frames America’s big natural his- 
tory museums as tools for the dissemina- 
tion of ideas about capitalism and society 
as much as they are tools for disseminat- 
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ing ideas about natural history. He shows 
how various financial and administrative 
mechanisms from capitalist ventures (par- 
ticularly industry) shaped the operations 
of natural history museums and led to a 
strict regulation of both their internal and 
external relations. 

The main focus is on the AMNH, argu- 
ably the United States’s most important 
natural history museum, and its main fig- 
urehead, the paleontologist Henry Fairfield 
Osborn. In a sense, this book is a careful 


The armature for a Diplodocus is welded at the 
Carnegie Museum of Natural History, ca. 1904. 


deconstruction of Osborn’s efforts to in- 
strumentalize natural history for ideologi- 
cal ends—ideologies that usually served 
the agenda of the AMNH’s backers. A good 
example of this would be his advocacy of 
“aristogenesis,’ a concept that holds that 
evolution proceeds in a predetermined 
path that favors the “best” genes—an idea 
obviously attractive to patricians at the top 
of New York’s social order. 

The same events were taking place in 
other new institutions, including Chicago’s 
Field Museum and Pittsburgh’s Carnegie 
Museum, both of which feature promi- 
nently in Rieppel’s narrative. He describes 
how wealthy philanthropists such as Mar- 
shall Field and Andrew Carnegie shaped 
these institutions in much the same form 
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as they had done with their commercial 
ventures: by applying a strict social hierar- 
chy and rigorous bookkeeping throughout 
their respective organizations. 

With the exception of Osborn, Rieppel 
may be somewhat guilty of underestimat- 
ing intermediary figures such as William 
Holland and Frederick Skiff, who actually 
converted their proprietors’ ideas into 
practice. These museum directors were not 
just delegates of their wealthy proprietors; 
they were their social equals and represen- 
tatives, and although they rarely misun- 
derstood the museum’s role in generating 
social capital for their bosses, they also 
had agendas of their own. This could make 
things difficult when the museum’s inter- 
ests failed to coincide with the interests of 
its patrons. Even Holland, for example—no 
stranger to sycophancy—occasionally is- 
sued stern reprimands to Carnegie when 
he felt his boss was losing sight of the mu- 
seum’s interests. 

Unlike the previous generation of collec- 
tors, the pressure was on new museum cu- 
rators to prove themselves as the defenders 
of “pure” scientific interests. But, as Riep- 
pel emphasizes, they eventually came to 
serve the same sociocommercial agenda 
because “pure” science was a highly valued 
ideological commodity whose aura could 
be made to fit more openly commercial 
agendas, too. 

Assembling the Dinosaur is a solid en- 
try into the growing body of literature on 
Gilded Age American paleontology, but it 
is particularly valuable for its contribution 
to enhancing our understanding of how 
science and its representation during that 
period were influenced by, and in turn af- 
fected, society as a whole. By incorporating 
cultural, economic, and scientific devel- 
opments, Rieppel shines new light on the 
history of both American paleontology and 
museum exhibition practice. ! 
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Invest in amphibians 
and reptiles 


The Yangtze giant softshell turtle (Rafetus 
swinhoei) is listed in Appendix I of the 
Convention on International Trade in 
Endangered Species of Wild Fauna and 
Flora (7) and categorized as Critically 
Endangered by the International Union 
for Conservation of Nature (2). Threats to 
Rafetus swinhoei have long been known, 
including habitat loss caused by humans’ 
excessive hunting for food and trade, 
resource competition, climate change, 
disease, and genetic stochasticity (3-5). 
Despite our understanding of the risks, on 
14 April, the only known female Rafetus 
swinhoei died in China, leaving one captive 
male in China and two known wild males 
in Vietnam. Unless another female is found 
in the wild, the species is now functionally 
extinct (6, 7). 

More investment in conservation 
may have been able to save this species. 
However, insufficient funding still hinders 
amphibian and reptile conservation, 
and public awareness of wildlife protec- 
tion remains weak. China must further 
expand its nature reserves and implement 
scientific management and protection 
policies to save other endangered amphib- 
ians and reptiles. In addition, developed 
countries and international organiza- 
tions should help developing countries 
that lack the capital, management, and 
technology required to protect reptiles 
and amphibians. Long-term interna- 
tional collaborations could prevent other 
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Only three known 
Yangtze giant 
softshell turtles 
remain. 


endangered amphibians and reptiles from 
following in the path of Rafetus swinhoet. 
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Brazil’s biodiversity 
researchers need help 


Between 2006 and 2014, Brazil experienced 
a boom in its national science and technol- 
ogy budget (7) that fomented local research 
groups and enhanced international col- 
laboration, resulting in the rise of Brazil in 
scientific rankings (2). During that period, 
Brazil engaged in environmental conserva- 
tion (3), earning recognition as a global 
biodiversity leader (4). However, recently 
elected president Jair Bolsonaro gutted 

the budget for science and higher educa- 
tion, causing funding agencies to run out 
of cash by July 2019 (5) and universities 
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to struggle to pay maintenance bills (6). 
Environmental regulations are collapsing, 
benefiting major landowners and com- 
modity companies that demand easier 
environmental licensing, permits for harm- 
ful pesticides, and elimination of protected 
areas (7). The advances in science and con- 
servation from the past decade will soon be 
reversed, which has caused international 
concern (8). As Brazilian researchers 
struggle to adjust to the national budget 
and biodiversity crises, the global biodi- 
versity community can help by recognizing 
the challenges Brazil’s scientific commu- 
nity now faces. 

Many international scientific societies 
are dedicated to fostering research through 
scientific meetings, research grants, and 
scientific publications. Researchers from 
low-income and lower-middle-income 
countries are often offered grants [e.g., 
(9)], discounts for membership and meet- 
ings [e.g., (J0)], and reduced publication 
fees (11). Brazilian scientists have not been 
eligible for these incentives since 2006 
(12), which made sense before the recent 
backslide in policy. However, in Brazil’s 
new antiscience climate, the eligibility 
requirements exclude an important share 
of qualified researchers now working 
under precarious conditions. We urge the 
international scientific community to help 
ameliorate the crisis by making Brazilian 
researchers eligible for incentives. 
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China's ineffective 
plastic solution to haze 


The Beijing-Tianjin-Hebei (BTH) region of 
China, one of the most densely populated 
regions in the world (1, 2), suffers from 
severe atmospheric haze (2, 3). In an effort 
to minimize fine dust blown by the wind 
[called eolian dust (4)], a precursor to haze 
(2), local governments have adopted a strat- 
egy of covering all bare land and soil in the 
region with plastic gauze [e.g., (5)]. However, 
the plastic has no effect on haze and causes 
soil pollution, increasing ecological risk. 

Statistically, persistent haze in the BTH 
region occurs during low wind speeds (less 
than 2 m/s), allowing an accumulation of 
fine particles [e.g., particulate matter with 
a diameter of less than 2.5 zm (PM, ,)] in 
the stagnant air (3). Although covering 


land with gauze may be useful to prevent 
localized dust generation during high 
winds, the gauze has no direct effect in the 
fight against haze (6—8). However, using 
gauze to cover land introduces a substan- 
tial amount of plastic into the soil, which 
can affect soil properties. Meanwhile, 
plastic decomposition may be a source 
of the microplastics in water and coastal 
sediments of the region (9). The plastic 
cover also hinders plant recolonization and 
harms natural habitats, including the habi- 
tat of migratory birds, which use the coast 
of Bohai Bay as an important staging site 
on the East Asian-Australasian Flyway (J0). 
It is clear that the vast amount of plastic 
deposited in the environment does not help 
ameliorate haze, but it is causing ecological 
problems. Policy-makers must stop treat- 
ing one environmental issue by creating 
another one. Instead of plastic cover, the 
bare soil and land should be rewilded with 
ecological restoration (11). 
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CANCER 


Sweet bystander becomes a villain 


atients with pancreatic cancer often have elevated blood levels of CA19-9, a carbohydrate 
antigen present on many proteins. CA19-9 is thus commonly used as a biomarker for diagnos- 
ing and monitoring disease progression. In a study of mice, Engle et al. found that CA19-9 may 
be more than an innocent bystander that marks the presence of pancreatic disease; it may 
play a causal role in disease (see the Perspective by Halbrook and Crawford). Transgenic mice 
expressing the human enzymes that add CA19-9 to proteins developed severe pancreatitis that 
could be reversed by treatment with CA19-9 antibodies. When the transgenic mice also harbored a 
Kras oncogene, they went on to develop pancreatic cancer. These unexpected observations suggest 
new avenues for the treatment of pancreatic disease. —PAK 


Science, this issue p. 1156; see also p. 1132 


particle mobility 


Girard et al., p. 1174 


An immunofluorescence image of a pancreas in a mouse model of pancreatic disease 


PHYSICS 
Improving precision with 
quantum amplification 


Quantum mechanically, an 
object can be described by a 
pair of noncommuting observ- 
ables, typically by its position 
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and momentum. The precision 
to which these observables 

can be measured is limited by 
unavoidable quantum fluctua- 
tions. However, the method of 


“squeezing” allows the fluctua- 


tions to be manipulated, while 
preserving the Heisenberg 
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uncertainty relation. This allows 
improved measurement preci- 
sion for one observable at the 
expense of increased fluctua- 
tions in the other. Burd et al. 
now show that an additional 
displacement of a trapped 
atom results in amplification 
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of the squeezing and a further 
improvement in the precision 
with which the displacement 
can be determined (see the 
Perspective by Schleier-Smith). 
This technique should be useful 
for anumber of applications in 
metrology. —ISO 

Science, this issue p. 1163; 

see also p. 1137 


RADIOTRACER CHEMISTRY 
A metal-free route to 
PET probes 


Positron emission tomography 
(PET) is a widely used imaging 
technique for medical diag- 
nostics and pharmaceutical 
development. As the name 
implies, it requires tracers that 
emit positrons, typically through 
labeling with fluorine or carbon 
radioisotopes. W. Chen et al. 
devised a versatile technique 
to incorporate radioactive 
fluoride into aromatic rings. 
The metal-free photochemical 
method directly substitutes ary! 
carbon-hydrogen bonds with 
[°F ]fluoride and so is particu- 
larly well suited to late-stage 
transformation of complex 
molecules into tracers. —JSY 
Science, this issue p. 1170 


CORAL REEFS 


Little fish make a big 
contribution 


Coral reefs represent one of 

the most biodiverse and rich 
ecosystems. Such richness con- 
jures up images of coral heads 
and large colorful reef fishes. 
Brandl et al. show, however, 
that one of the most striking 
and important parts of the reef 
ecosystem is almost never seen 
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(see the Perspective by Riginos 
and Leis). Small cryptobenthic 
fish, like blennies, make 
up nearly 40% of reef fish 
biodiversity. Furthermore, the 
majority of cryptobenthic fish 
larvae settle locally, rather than 
being widely dispersed, and 
have rapid turnover rates. Such 
high diversity and densities 
could thus provide the biomass 
base for larger, better-known 
reef fish. —SNV 

Science, this issue p. 1189; 

see also p. 1128 


BIOCATALYSIS 
Light teaches 
(co)enzymes new tricks 


Light is widely used in organic 
synthesis to excite electrons 

in a substrate or catalyst, 
opening up reactive pathways 
to a desired product. Biology 
uses light sparingly in this way, 
but coenzymes such as flavin 
can be driven to excited states 
by light. Biegasiewicz et al. 
investigated this reactivity and 
found a suite of flavoenzymes 
that catalyze asymmetric radi- 
cal cyclization when exposed 
to light. “Ene”-reductases, 
when reduced and illuminated, 
converted starting materials 
containing an a-chloroamide 
and an alkene into five-, six-, 
seven-, or eight-membered 
lactams. Different enzymes 
furnished different stereo- 
chemistry in the products, likely 
because of changes in active- 
site pocket geometry. —MAF 


Science, this issue p. 1166 


STRUCTURAL BIOLOGY 
Keeping the gate open 


Cystic fibrosis is a progres- 
sive disease that affects lung 
function and is often fatal. It 

is caused by mutations in the 
cystic fibrosis transmembrane 
conductance regulator (CFTR). 
One class of mutants impairs 
ion conductance, and the drug 
ivacaftor acts by increasing the 
ion flux. Liu et al. describe high- 
resolution structures of CFTR 
bound to ivacaftor and to an 
investigational drug GLPG1837 
that also potentiates ion flow. 


SCIENCE sciencemag.org 


The two drugs bind at the same 
site in the transmembrane 
region. This site coincides with 
a hinge involved in channel gat- 
ing, suggesting that the drugs 
may stabilize the open confor- 
mation of the channel. —VV 
Science, this issue p. 1184 


ISLET TRANSPLANTATION 
Revascularizing eye-lets 


Pancreatic islet transplanta- 
tion is a potentially promising 
therapy for type 1 diabetes, 
but poor revascularization 
hinders islet long-term viabil- 
ity. Figueiredo et al. studied 
the role of protein tyrosine 
phosphatase 1B (PTP1B) in 
regulating islet vasculariza- 
tion. Islets from PTP1B~’- mice 
retained more endothelial 
cells during in vitro culture 
and restored normoglycemia 
when transplanted into the 
anterior chamber of the eye in 
diabetic mice. Future work is 
needed to determine whether 
targeting PTP1B can improve 
islet transplantation in human 
patients. —CC 

Sci. Transl. Med. 11, eaar6294 (2019). 


NEURODEVELOPMENT 
Rescued by a dietary 


supplement 
Patients with Rett-like syn- 
drome show impaired motor 
and cognitive development. 
Soto et al. studied this disorder 
ina 5-year-old patient with a 
mutation in an N-methyl-D- 
aspartate (NMDA) receptor 
subunit. The electrophysi- 
ological and morphological 
defects in neurons expressing 
this mutant were improved by 
the NMDA receptor agonist 
D-serine. D-Serine itself can be 
toxic, but it can be converted 
to its stereoisomer L-serine, a 
natural component of vari- 
ous foods. Supplementing the 
patient's diet with L-serine 
powder increased her D-serine 
evels and improved her motor 
and cognitive performance 
after about 1 year of treatment. 
—LKF 

Sci. Signal. 12, eaaw0936 (2019). 
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IMMUNOLOGY 


BeATing back obesity 


hildren who are breastfed appear to be at a lower risk for 
obesity later in life. One explanation is that breast milk 
may somehow prevent the premature transition from 
beige adipose tissue (BeAT) to white adipose tissue 
during childhood and adolescence. Yu et al. report that 
alkylglycerol ether lipids found in breast milk maintain the 
BeAT of infant mice. Adipose tissue macrophages metabolize 
these lipids into platelet-activating factor, which then induces 
macrophage release of interleukin-6, signal transducer and 
activator of transcription 3 (STAT3) signaling in adipocytes, 
and the development of BeAT. Although usually inactivated 
in adulthood, this pathway resurrects in obese adipose tis- 
sue, indicating a degree of metabolic flexibility that could be 
exploited for therapeutic applications. —STS 


J. Clin. Invest. 129, 2485 (2019). 


Lipids in breast milk delay the transition from beige to white adipose 


tissue and protect against obesity. 


INFORMATION STORAGE 
Storing information in 
molbytes 


Present technologies for storing 
information have been devel- 
oped over many decades, and 
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each has its own weaknesses 
and strengths. In this work, 
Cafferty et al. present a possible 
alternative storage approach 
that uses mixtures of small 
organic molecules (“molbytes,” 
oligopeptides in the present 
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study), distinguishable by mass 
spectrometry, to encode, write, 
store, and read any information 
in a binary manner. Each new 
message depends entirely on 
simple physical manipulations 
with the molecules, so no addi- 
tional synthesis is required. The 
proposed strategy is in its early 
development stage but dem- 
onstrates that the chemistry of 
small molecules can potentially 
offer alternative approaches for 
secure long-term, zero-energy 
storage of information. —YS 


ACS Cent. Sci. 5,911 (2019). 


AGING 
Senescent beta cells 


Removal of senescent cells, 
which accumulate during 
aging, may have clinical utility 
in managing chronic diseases. 
One chronic disease com- 
monly associated with aging 

is type 2 diabetes. Diabetes 
develops when pancreatic beta 
cells age and senesce, which 
then contributes to failure of 
glucose homeostasis. Aguayo- 
Mazzucato et al. investigated 
senescence of the insulin- 
producing beta cells in mouse 
models of diabetes. Removal of 
senescent cells with drug treat- 
ment or by specific depletion 
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of cells expressing the marker 
p16'*2 in transgenic mice 
helped restore beta-cell func- 
tion. Similar properties were 
observed in cultured human 
beta cells—results that hint of 
translational possibilities. -—LBR 
Cell Metab. 10.1016/ 
j-cmet.2019.05.006 (2019). 


NEUROSCIENCE 
Imagination as a dialogue 


The hippocampus constructs 
scene imagery to facilitate 
recollected or imagined mental 
representations. However, input 
from the ventromedial prefrontal 
cortex (vmPFC) is also needed 
for scene construction. How do 
these regions interact when we 
imagine a scene? Barry et al. 
addressed this question using 
magnetoencephalography. 
Participants imagined novel 
scenes or single objects after 
being given a cue. The direc- 
tion of information flow during 
scene imagination mirrored 
that observed during episodic 
memory retrieval, in which 
vmPFC drives hippocampal 
activity. These results indicate 
that the vmPFC selects the 
elements for a scene, whereas 
the hippocampus is necessary 
to construct the scene imagery. 
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The vmPFC strongly modulates 
the construction of spatially 
coherent, contextually appro- 
priate scene imagery. Episodic 
memory and imagination thus 
use similar regional dialogue in 
the brain. —PRS 


J. Neurosci. 39, 4375 (2019). 


CELL BIOLOGY 


QC keeps on keeping on 
Several protein quality control 
processes have been identi- 
fied that monitor and remove 
misfolded or damaged proteins 
in the endoplasmic reticulum. 
Schmidt et al. asked whether 
additional pathways moni- 

tor proteins further along the 
secretory pathway in the Golgi 
complex and the endosome. 
They identified a degradation 
pathway in budding yeast cells 
associated with endosome and 
Golgi, which they named EGAD. 
EGAD extracts membrane 
proteins from the Golgi and 
endosomes, allowing them to 
become substrates for cytosolic 
proteasomal degradation. An 
important cellular substrate 

of this pathway is a protein 
called Orm2, which is involved 
in down-regulating sphingolipid 
biosynthesis. The selective deg- 
radation of Orm2 facilitates the 
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PSYCHOLOGY 


Social media use and 
mental health 


here has been public concern that 
increasing social media use among ado- 
lescents may cause reduced well-being. 
Orben et al. analyzed social media use 
among more than 12,000 teenagers 
in the United Kingdom over the course of 8 
years to determine whether increased social 
media use predicted reduced life satisfaction 
over time. They found virtually no effect of 
social media use on life satisfaction either 
between individuals or in the same individual 
across time. These results suggest that 
concern over the use of social media and its 
relationship to mental health may be unwar- 
ranted. —TSR 
Proc. Natl. Acad. Sci. U.S.A. 116, 10226 (2019). 


Adolescents using social media on their smartphones 


homeostatic regulation of sphin- 
golipid biosynthesis. —SMH 

EMBO. 10.15252/ 

embj.2018101433 (2019). 


NANOMATERIALS 
Enhancing nanodiamond 
sensors 


The fluorescence and spin 
properties of nitrogen vacancy 
(NV) centers in nanodiamonds 
could make them useful sensors 
when linked to biomolecules. 
Although nanodiamonds with 
diameters of about 5 nanome- 
ters can be extracted from the 
products of closed-chamber 
detonation of explosives like 
trinitrotoluene, the number of 
spin-triplet (negatively charged) 
NV centers can be too low to 
ensure that each nanodiamond 
is active. Terada et al. show 
that irradiation of detonation 
nanodiamonds with a sufficient 
fluence of 2 mega-—electron 
volt electrons can increase the 
number of negatively charged 
NV centers by a factor of 4. The 
irradiation process aggregated 
the nanodiamonds, but they 
were redispersed with boiling 
oxidizing acid. —PDS 
ACS Nano 10.1021/ 
acsnano.8b09383 (2019). 
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ROBOTICS 
Hand it to you 


Our ability to grab, hold, and 
manipulate objects involves our 
dexterous hands, our sense of 
touch, and feedback from our 
eyes and muscles that allows 
us to maintain a controlled 
grip. Billard and Kragic review 
the progress made in robot- 
ics to emulate these functions. 
Systems have developed from 
simple, pinching grippers 
operating ina fully defined 
environment, to robots that can 
identify, select, and manipulate 
objects from a random collec- 
tion. Further developments are 
emerging from advances in 
computer vision, computer pro- 
cessing capabilities, and tactile 
materials that give feedback to 
the robot. -MSL 

Science, this issue p. 1149 


RUMINANT GENOMICS 
Phylogeny and 
characteristics of 
ruminants 


Ruminants are a diverse group of 
mammals that includes fami- 
lies containing well-known taxa 
such as deer, cows, and goats. 
However, their evolutionary 
relationships have been conten- 
tious, as have the origins of their 
distinctive digestive systems and 
headgear, including antlers and 
horns (see the Perspective by 
Ker and Yang). To understand the 
relationships among ruminants, 
L. Chen et al. sequenced 44 spe- 
cies representing 6 families and 
performed a phylogenetic analy- 
sis. From this analysis, they were 
able to resolve the phylogeny 

of many genera and document 
incomplete lineage sorting among 
major clades. Interestingly, they 
found evidence for large popula- 
tion reductions among many 
taxa starting at approximately 
100,000 years ago, coinciding 
with the migration of humans 
out of Africa. Examining the bony 
appendages on the head—the 
so-called headgear—Wang et al. 
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describe specific evolutionary 
changes in the ruminants and 
identify selection on cancer- 
related genes that may function 
in antler development in deer. 
Finally, Lin et al. take a close 
look at the reindeer genome 
and identify the genetic basis of 
adaptations that allow reindeer to 
survive in the harsh conditions of 
the Arctic. —LMZ 
Science, this issue p. 1152, 
p. 1153, p. 1154; see also p. 1130 


MOLECULAR MACHINES 
Flexible domains 
in a well-oiled machine 


Motors convert one form 
of energy into another. For 
biological motors, adenosine 
triphosphate (ATP) serves as 
chemical energy and its hydroly- 
sis is coupled to conformational 
changes that exert mechanical 
force. ATP synthases reverse 
this process in a multistep 
process: first converting an 
electrochemical gradient to rota- 
tional kinetic energy, and then 
coupling rotation to formation 
of high-energy phosphodiester 
bonds. Murphy et al. investigated 
these energy changes in the 
dimeric mitochondrial F,-F, ATP 
synthase from Polytomella sp., 
a unicellular alga. They solved 
high-resolution cryo—electron 
microscopy structures of the 
ATP synthase complex, extract- 
ing 13 rotational substates. This 
collection of structures revealed 
that the rotation of the F, ring 
and central stalk is coupled with 
partial rotations of the F, head. 
This flexibility may enable the 
head to better couple continu- 
ous rotation with discrete ATP 
synthesis events. —MAF 

Science, this issue p. 1155 


FISHERIES 
Asmall, interconnected 
world 


Countries manage their fisheries 
as if they were a local resource. 
To some degree, this may reflect 
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reality, but marine fish, perhaps 
more than any other vertebrate 
group, are connected across 
large distances through ocean 
currents. Ramesh et al. model 
how these currents distribute 
the fish larvae of more than 700 
species. They used network 
analysis to assess the degree to 
which populations found in one 
part of the world may have come 
from another. It seems that 
global fish populations represent 
a small-world network where 
connections across populations 
are tight and particular hubs of 
productivity are widely impor- 
tant. Such connectivity has 
wide-ranging implications for 
conservation, management, and 
food supplies globally. —SNV 
Science, this issue p. 1192 


MUCOSAL IMMUNITY 
Context shapes 
anticommensal immunity 


The gut bacterium Akkermansia 
muciniphila is associated 
with protection from obesity, 
enhanced wound healing, 
and augmented antitumor 
responses. Ansaldo et al. found 
that this microbe induces 
antigen-specific immunoglobulin 
Gl (lgG1) antibodies generated 
by B cells with CD4* T cell help. 
This is in contrast to most anti- 
commensal responses, which 
involve the T cell-independent 
production of IgA antibodies. In 
a gnotobiotic setting in which all 
components of the microbiome 
are defined, A. muciniphila—spe- 
cific T cells expanded only when 
A. muciniphila was present. 
The T cells primarily displayed 
a phenotype associated with 
B cell help. However, in mice 
with a conventional gut micro- 
biota, other proinflammatory 
A. muciniphila—specific T cell 
populations also expanded. 
Thus, anti-A. muciniphila immu- 
nity is context dependent, which 
may explain the variable immune 
responses to this microbe 
reported in patients. —STS 
Science, this issue p. 1179 
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CONSERVATION 
Integrating knowledge 


systems for wildlife health 
Knowledge held by local or 
indigenous populations is often 
crucial for supporting wildlife 
health and conservation efforts. 
In a Perspective, Kutz and 
Tomaselli highlight efforts to 
integrate these different forms of 
knowledge in a mutually respect- 
ful manner. For example, use of 

a participatory epidemiology 
approach that integrates scien- 
tific and indigenous knowledge 
was key to eradicating rinderpest 
in its last foci in East Africa. A 
similar approach led to a greater 
understanding of the mecha- 
nisms driving muskox population 
declines in the Canadian Arctic. 
Thus, going forward, rigorous 
research design that is based on 
ongoing iteration and feedback 
could enable inclusion of qualita- 
tive local indigenous knowledge 
in models and decision-making. 
—JFU 


Science, this issue p. 1135 


COLLOIDAL MATERIALS 
Mobile particles in 
colloidal crystals 


The crystallization of nanopar- 
ticles can be controlled by 
functionalizing them with DNA 
strands that direct assembly 
through hybridization. The 
design rules for interactions 
between pairs of particles 
resemble those for ionic com- 
pounds. Inspired by molecular 
dynamics simulations, Girard 
et al. show that larger particles 
(~10 nanometers in diameter) 
that have mutual repulsive inter- 
actions can form a stable lattice 
only if much smaller conjugate 
particles (~1.5 nanometers in 
diameter) are present. These 
smaller particles are mobile 
and diffuse through the lattice, 
so the bonding interaction 
resembles the classical picture 
of electrons in metals. —PDS 
Science, this issue p. 1174 
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CANCER 
The microbiota in 
colorectal cancer 


Several types of bacteria have 
been associated with colorectal 
cancer, but do these bacteria 
have a role in tumorigenesis? 
Data remain unclear, but evi- 
dence suggests that some types 
of bacteria can influence tumor 
progression and responses to 
therapy. In a Perspective, Garrett 
discusses how to advance micro- 
biota research to potentially 
improve prevention, diagnostics, 
and therapy of colorectal cancer. 
—GKA 

Science, this issue p. 1133 


INFLAMMATION 
Acolitis circuit 


Cytokines are known to play a 
critical role in maintaining gut 
homeostasis, but their spe- 
cific cellular sources are less 
well understood. Bernshtein 
et al. used a murine model of 
inflammatory bowel disease 
in which macrophages specifi- 
cally lacked expression of the 
interleukin-10 receptor (IL-10R). 
The mice developed symptoms 
of spontaneous colitis similar 
to those observed in children 
with IL-LOR mutations. The 
macrophage-derived cytokine 
IL-23 was critical for inducing 
the pathology. IL-23 triggered 
accumulation of |L-22—produc- 
ing T helper 17 cells that, in 
turn, promoted production of 
chemokines by colonic epithelial 
cells and destructive neutrophil 
recruitment. —CNF 

Sci. Immunol. 4, eaau6571 (2019). 


HYDROLOGY 
Modeling groundwater 
depletion 


Understanding the full effects of 
human activities on groundwater 
resources has been a persis- 
tent challenge to hydrologic 
models. Such models often 
seek to inform best practices 
for resource management. One 
important gap in our under- 
standing is how groundwater 
depletion affects surface water 
behavior. Condon and Maxwell 


SCIENCE sciencemag.org 


simulated the impact of 100 
years of groundwater declines 
across the continental United 
States. Their integrated model 
illuminates the nature of con- 
nections between groundwater 
pumping and changes in natural 
watersheds, including the 
widespread impacts of pumping 
on evapotranspiration rates and 
streamflow. —KVH 
Sci. Adv. 10.1126/ 
sciadv.aav4574 (2019). 
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BACKGROUND: Humans have a fantastic abi- 
lity to manipulate objects of various shapes, 
sizes, and materials and can control the ob- 
jects’ position in confined spaces with the 
advanced dexterity capabilities of our hands. 
Building machines inspired by human hands, 
with the functionality to autonomously pick 
up and manipulate objects, has always been 
an essential component of robotics. The first 
robot manipulators date back to the 1960s and 
are some of the first robotic devices ever con- 
structed. In these early days, robotic manipula- 
tion consisted of carefully prescribed movement 
sequences that a robot would execute with no 
ability to adapt to a changing environment. As 
time passed, robots gradually gained the abil- 
ity to automatically generate movement se- 
quences, drawing on artificial intelligence and 
automated reasoning. Robots would stack boxes 
according to size, weight, and so forth, extending 
beyond geometric reasoning. This task also re- 
quired robots to handle errors and uncertainty 
in sensing at run time, given that the slightest 
imprecision in the position and orientation of 
stacked boxes might cause the entire tower to 
topple. Methods from control theory also be- 
came instrumental for enabling robots to com- 
ply with the environment’s natural uncertainty 
by empowering them to adapt exerted forces 
upon contact. The ability to stably vary forces 
upon contact expanded robots’ manipulation 
repertoire to more-complex tasks, such as in- 
serting pegs in holes or hammering. However, 
none of these actions truly demonstrated fine 
or in-hand manipulation capabilities, and they 
were commonly performed using simple two- 
fingered grippers. To enable multipurpose fine 
manipulation, roboticists focused their efforts 
on designing humanlike hands capable of using 
tools. Wielding a tool in-hand became a prob- 
lem of its own, and a variety of advanced 
algorithms were developed to facilitate stable 
holding of objects and provide optimality 
guarantees. Because optimality was difficult 
to achieve in a stochastic environment, from 
the 1990s onward researchers aimed to increase 
the robustness of object manipulation at all 
levels. These efforts initiated the design of 
sensors and hardware for improved control 
of hand-object contacts. Studies that followed 
were focused on robust perception for coping 
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with object occlusion and noisy measurements, 
as well as on adaptive control approaches to 
infer an object’s physical properties, so as to 
handle objects whose properties are unknown 
or change as a result of manipulation. 


ADVANCES: Roboticists are still working to 
develop robots capable of sorting and pack- 
aging objects, chopping vegetables, and folding 
clothes in unstructured and dynamic environ- 
ments. Robots used for modern manufactur- 
ing have accomplished some of these tasks in 


Holding two objects in one hand requires 
dexterity. Whereas a human can grab 
multiple objects at the same time (top), a 
robot (bottom) cannot yet achieve such 
dexterity. In this example, a human has placed 
the objects in the robot's hand. 


structured settings that still require fences be- 
tween the robots and human operators to ensure 
safety. Ideally, robots should be able to work 
side by side with humans, offering their strength 
to carry heavy loads while presenting no danger. 
Over the past decade, robots have gained new 
levels of dexterity. This enhancement is due to 
breakthroughs in mechanics with sensors for 
perceiving touch along a robot’s body and new 
mechanics for soft actuation to offer natural 
compliance. Most notably, this development 
leverages the immense progress in machine 


learning to encapsulate models of uncertainty 
and support further advances in adaptive and 
robust control. Learning to manipulate in real- 
world settings is costly in terms of both time 
and hardware. To further elaborate on data- 
driven methods but avoid generating examples 

with real, physical systems, 
many researchers use sim- 
Read the full article ancien ener eell 
at http://dx.doi. grasping and dexterous 
org/10.1126/ manipulation require a 
science.aat8414 level of reality that exist- 
ing simulators are not yet 
able to deliver—for example, in the case of 
modeling contacts for soft and deformable 
objects. Two roads are hence pursued: The 
first draws inspiration from the way humans 
acquire interaction skills and prompts robots 
to learn skills from observing humans per- 
forming complex manipulation. This allows 
robots to acquire manipulation capabilities in 
only a few trials. However, generalizing the 
acquired knowledge to apply to actions that 
differ from those previously demonstrated re- 
mains difficult. The second road constructs 
databases of real object manipulation, with 
the goal to better inform the simulators and 
generate examples that are as realistic as pos- 
sible. Yet achieving realistic simulation of 
friction, material deformation, and other phys- 
ical properties may not be possible anytime 
soon, and real experimental evaluation will be 
unavoidable for learning to manipulate highly 
deformable objects. 


OUTLOOK: Despite many years of software 
and hardware development, achieving dexter- 
ous manipulation capabilities in robots remains 
an open problem—albeit an interesting one, 
given that it necessitates improved understand- 
ing of human grasping and manipulation 
techniques. We build robots to automate tasks 
but also to provide tools for humans to easily 
perform repetitive and dangerous tasks while 
avoiding harm. Achieving robust and flexible 
collaboration between humans and robots is 
hence the next major challenge. Fences that 
currently separate humans from robots will 
gradually disappear, and robots will start manip- 
ulating objects jointly with humans. To achieve 
this objective, robots must become smooth 
and trustable partners that interpret humans’ 
intentions and respond accordingly. Further- 
more, robots must acquire a better understand- 
ing of how humans interact and must attain 
real-time adaptation capabilities. There is also a 
need to develop robots that are safe by design, 
with an emphasis on soft and lightweight struc- 
tures as well as control and planning method- 
ologies based on multisensory feedback. 


The list of author affiliations is available in the full article online. 
*Corresponding author. Email: aude.billard@epfl.ch 
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Dexterous manipulation is one of the primary goals in robotics. Robots with this capability 
could sort and package objects, chop vegetables, and fold clothes. As robots come to 
work side by side with humans, they must also become human-aware. Over the past 
decade, research has made strides toward these goals. Progress has come from advances 
in visual and haptic perception and in mechanics in the form of soft actuators that offer 
a natural compliance. Most notably, immense progress in machine learning has been 
leveraged to encapsulate models of uncertainty and to support improvements in adaptive 
and robust control. Open questions remain in terms of how to enable robots to deal 

with the most unpredictable agent of all, the human. 


ave you ever found yourself busy forag- 
ing in your bag in search of a set of keys? If 
so, you may recall that it took only a few 
seconds to find them among the disparate 
contents of the bag. For certain, you did 
not reflect on your abilities and may have carried 
on this display of unique dexterity through a 
swift in-hand manipulation, taking out the cor- 
rect key and inserting it into the lock even 
though the corridor lights had gone out. All day 
long, our fingers grasp, move, and transform 
objects and interact with objects in various media 
such as air, water, and oil. We do not spend time 
thinking about what our hands and fingers are 
doing or how the continuous integration of var- 
ious sensory modalities—such as vision, touch, 
proprioception, and hearing—help us outperform 
any other biological system in the breadth of the 
interaction tasks we can execute. Largely over- 
looked, and perhaps most fascinating, is the ease 
with which we perform these interactions, re- 
sulting in a belief that they are also easy to ac- 
complish in artificial systems such as robots. 
Manipulating objects is such a ubiquitous ac- 
tivity that we forget how difficult it was to ac- 
quire this competence as a child. Children are 
born with simple grasp reflexes. It takes them 
3 years to develop an individuated control of 
each finger and another 6 years to display an 
adult-equivalent ability for making smooth con- 
tact and for planning sequences of manipulation 
skills (2). Even for humans, some dexterous activ- 
ities may pose a challenge. For example, tying 
shoes may be done in various ways, and there 
may be several valid models of how to execute 
such an activity. In addition, we can visually dem- 
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onstrate how to do something and what the 
expected result may be, but we cannot easily 
communicate the magnitude of the applied 
forces and torques or the size of the friction co- 
efficient necessary to satisfy stability conditions. 
Still, we find ways of achieving manipulation 
goals through training and exploration even if 
the end result is not always optimally performed. 
We may also adapt as circumstances dictate (e.g., 
tying shoes with an excess of free shoelace or 
when the ends are quite short), forcing us to 
deviate from our normal methods. Thus, the 
context in which interactions are performed af- 
fects various parameters of the execution. 
Although robotics has made vast progress in 
mechanical design, perception, and robust con- 
trol targeted to grasping and handling objects, 
robotic manipulation is still a poor proxy for hu- 
man dexterity. To date, no robots can easily hand- 
wash dishes, button a shirt, or peel a potato. 


What can robots do today? 


Robots are skilled at picking up and manipu- 
lating objects in repetitive and familiar settings 
such as industrial assembly setups. In such set- 
tings, the geometry, material properties, and 
weight of the objects are commonly known. 
Robots can handle some variation in routine 
movements in terms of adapting to small differ- 
ences in the object properties, but the whole pro- 
cess is typically optimized to a limited set of 
expected variations. In early factory settings, 
robot arms followed predetermined trajectories 
and assumed that objects would always appear 
at the same place. Today, robots can adapt their 
trajectory to retrieve objects at different loca- 
tions, making it possible for objects to be placed 
by humans or simply dropped on a conveyer belt 
instead of being deposited at exact positions by 
other machines. The classical assembly lines in 
which robots were bolted into the floor and placed 
one after another, typical for the automobile in- 
dustry, can now be made more flexible. Objects 
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moving on conveyers can be detected fairly easily 
by cameras and picked up if fully visible. How- 
ever, detection of transparent objects or objects 
partially hidden (e.g., when stacked on top of one 
another) remains difficult. 

With the need to frequently change the type 
of goods produced, the robotics industry strives 
for multipurpose object grasping and handling 
solutions. One step toward this objective is to 
provide robots with a choice of grippers varying 
in size and strength and to enable robots with 
tool-changing mechanisms so that they can select 
the correct tool. To determine which tool to use 
for a given task, a robot must have knowledge of 
an object’s properties, such as shape, weight, ma- 
terial, and so forth. This information is readily 
available in factories where all objects are known. 
However, this requirement presents a limitation 
for robots in other settings, where the set of objects 
to be manipulated may not be known beforehand. 


What can robots not do today? 


Although robots are adept at handling rigid 
objects, they still struggle with flexible materials— 
such as fruits and vegetables or clothing items— 
that differ in size, weight, and surface proper- 
ties. Manipulations that produce a deformation 
(e.g., inserting, cutting, or bending) are particu- 
larly difficult, as accurate models of the defor- 
mations are needed. Industrial grippers often 
use pneumatic vacuum pumps to pick up objects 
by sucking. This technique is unbeatable when 
it comes to grasping an object but is much less 
useful for object manipulation (e.g., reorienting 
the object and placing it in a confined space). 
One step to address this challenge is to provide 
robots with more dexterous hands. Yet creating 
hands as dexterous as human hands is difficult, 
owing to a lack of sensors and actuators equiv- 
alent in size, precision, and efficiency to our 
skin and muscles. 

Improvements in robots’ dexterity are not lim- 
ited to the engineering of more-capable hands. 
Advanced software programs are required to 
analyze in real time the large flux of visual, tac- 
tile, and force information and to relate these 
different senses to recognize objects and model 
their transformations. Additionally, robots need 
advanced cognitive capabilities to predict where, 
how, and why to manipulate objects. The rest 
of this Review describes why overcoming these 
challenges is difficult and where the field of 
robotics stands today. 


Why is designing robotic 
hands difficult? 


Although research on robot hands has been on- 
going for more than five decades (2-4), the most 
common hand used in many applications to date 
is still a parallel jaw gripper, usually without any 
extra sensing. Picking up objects with a gripper 
devoid of sensing is akin to grasping with the 
tip of your thumb and index finger when both 
are numb! This tool may suffice for simple pick- 
and-place actions, but not for more-complex 
motions such as shuffling keys. Because the 
human hand performs intricate movements with 
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Fig. 1. Soft hands. Robotic hands are traditionally made of hard materials with rigid control of fingers. Recent designs aim to mimic the human hand's 
natural compliance by using soft actuators, soft materials, and advanced controllers. (A) Rigid material and actuators and (B) a rigid cover with 
partially soft cable-driven actuation: Both hands become soft through software intervention, modulating pressure at the fingertips via tactile feedback. 
[Reproduced from (17, 33)] (C) A soft, foldable gripper that can adapt shape and stiffness. [Reproduced from (11)] (D) Soft actuation and material 
for a rehabilitation glove that can be worn by a human. [Reproduced from (12)] 


ease, it is a natural inspiration for robotics. But 
designing robotic hands with sensors and ac- 
tuators similar to those of the human hand is 
difficult for many reasons. 

When constructing anthropomorphic robot 
hands, it is challenging to fit all of the necessary 
actuators, sensors, and mechanical structure in 
the limited available space. Another obstacle is 
to keep the total weight of the hand low so that 
it satisfies the payload requirements of the 
arm to which it is attached. Hence, compared 
with human hands, most anthropomorphic 
hands and prostheses do not have nearly as 
many controllable degrees of freedom (5, 6). 

The human hand is soft and flexible, with a 
dexterous thumb whose distinctive range of mo- 
tion remains difficult to replicate mechanically 
(7), as the intricate combinations of tendons and 
muscles differ markedly from traditional serial 
robotic joint design (8). Today, robotic hands are 
still largely composed of rigid plastic and metal 
components, with electric motors as actuators. 
This rigidity is partially the cause of the lack of 
dexterity, as it allows no room for mistakes when 
executing grasps. Rigid fingers closing on an ob- 
ject may easily move rather than grasp an object 
if its pose is not perfectly estimated, and apply- 
ing too much force may crush the object. A grow- 
ing trend in robotics is the development of soft 
hands that can conform to an object’s shape, 
absorb unexpected forces at contact, and com- 
pensate for load change during manipulation 
(9, 10). 

Softness can be achieved through a change in 
hardware or software or a combination of both 
(Fig. 1). Softness from material used to construct 
hands builds on solutions from 3D manufac- 
turing and materials science. For instance, one 
can manufacture rigid and flexible materials in 
a layer-by-layer manner to create foldable fingers 
that can deploy and retract as needed (11). Cur- 
rently, the low payload and slow speed of these 
elastomers restricts manipulation to light objects 
only. As an alternative to generate more power, 
pneumatic or hydraulic actuation may be used 
(22, 13). 

Human hands are covered with a multipurpose 
skin that provides the appropriate level of friction 
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Fig. 2. Sense of touch for robots. (Left) Vision can be used to infer contact forces (red). 
[Reproduced from (21)] (Right) Stretchable artificial skin measures contact at knuckles, which may 
be useful for exploring internal parts of objects. [Reproduced from (18)] 


and damping. Human skin is a high-frequency 
and high-resolution sensor that provides precise 
information on normal and tangential forces, 
information that is critical for grip adjustment. 
Human skin can also measure stretch and tem- 
perature. By contrast, robot hands typically mea- 
sure exerted forces through miniature force 
sensors placed solely at the fingertips (74). Force 
sensors yield very accurate 3D measurements, 
but they cannot easily reveal the exact location 
of contact. To move objects once held in the hand 
or to hold multiple objects at once (Fig. 1), one 
needs to measure precise contact points, not just 
at the fingertips but also along the length and 
side of the fingers and inside the palm. This can 
be achieved through artificial skins that provide 
contact measurements all along the limbs. In- 
terest in artificial skins can be traced back to the 
1980s (15, 16), but major advances were achieved 
in the past decade. 

At present, we find a variety of affordable 
commercial products, several of which can be 
customized to a robot’s shape. Touch sensors 
measure the normal contact force; a few also 
provide data on tangential forces, torque, tem- 
perature, vibrations, or surface properties. Never- 
theless, most touch sensors are rigid, and their 
placement is constrained to fingertips and along 
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limb segments. Touch detection at the joint 
(knuckle, elbow, knee) is, however, crucial to de- 
tect entrapment. It is also useful to guide explo- 
ration inside objects (17). Such contact can be 
detected only by soft sensors that bend and ex- 
tend along the flexion and extension points of 
the limb (Fig. 2, right) (18). Hence, flexible and 
stretchable skins are of utmost interest to robot- 
icists (19). Prototypes exist in laboratories, and 
we can expect to see their deployment soon, 
given the current interest in soft electronics (20). 

As an alternative to using skin, it is possible 
to deduce haptic (contact and force) information 
from vision. One can, for instance, infer forces 
from vision through a dynamic model of contacts 
(21) (Fig. 2, left) or use an optical sensor that 
renders deformation of an object’s geometry at 
a high spatial resolution (22). The necessity of 
estimating the exact position of an object, its 
local geometry, and other properties such as 
weight and weight distribution depends strongly 
on the application. It is the interplay among 
hand design, material, and internal and external 
sensing that offers the appropriate redundancy. 
One additional challenge is the need to mea- 
sure contact at a very high frequency for accu- 
rate and timely detection of slip (23). Such high 
spatial and temporal resolution, together with 
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the real-time processing of vision data for ob- 
ject tracking, leads to a computational overload 
with a massive data stream that must be inter- 
preted in real time. This processing is usually 
carried out by a CPU (central processing unit) 
located away from the hand. Alternatively, pro- 
cessing may be performed on the hand itself 
through the use of a dedicated CPU (24), but 
such CPUs have focused solely on processing 
vision data. Additional studies are needed to 
develop hardware for processing tactile infor- 
mation in conjunction with vision. 

Dexterous robot hands may hence be realized 
by using research in materials science for the 
design of soft actuation, enabling contact sensing 
along the entire surface of the hand, and by using 
advances in electronics for onboard, real-time 
processing of multisensory data. 


Design beyond anthropomorphism 


Although the human hand is fascinating, it does 
not have to be the ultimate solution for robotics. 
A human hand design may be desirable for aes- 
thetic reasons; for instance, when designing hand 
prostheses or humanoid robots. But this same 
design may be superfluous for many robots. In- 
dustrial hands remain a good solution for spe- 
cific tasks. Rather than try to replicate the 
positioning of human fingers, these hands have 
two or three fingers arranged symmetrically 
around the palm, a design particularly suited 
for industrial screwing. 

Robotics keeps oscillating between anthropo- 
morphic and traditional industrial designs for 
hands. But the gripping systems of simpler ani- 
mals may also provide inspiration. For instance, 
fish suck in their prey. Adding suction at robots’ 
fingertips is useful under water, as this technol- 
ogy cancels the flow generated by the hand (25). 

Why not create hands that both leverage and 
go beyond nature? For instance, the human 
thumb is amazing, but it creates an asymmetry 
that constrains the orientation of the hand for 
manipulation. Two thumbs on the same hand, 
however, would provide a dexterity beyond 
human capability (Fig. 3). 


Desiderata for the next generation 
of robotic hands 


The objects around us have been built and 
adapted to our hands, which are still rather 
small and very robust in comparison with con- 
temporary robot hands. Enabling robots to pick 
up small items such as pens, raisins, screws, and 
needles is a clear functionality goal. Today, ro- 
botic arms and hands are commonly developed 
separately, and integrating them is an engineer- 
ing job of its own. Industrial arms have sub- 
stantial payloads but are commonly designed to 
be bolted into the floor and are too large to be 
deployed outside industrial settings. The arms 
of humanoid robots and robots intended for 
fine assembly tasks have low payloads, which 
are typically not sufficient to carry a hand and 
an object held by the hand. Adding sensing 
functionality to arms and hands requires cab- 
ling that can quickly become complicated. Fur- 
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thermore, many hands come with no or limited 
means of measuring contact and forces. Thus, 
a change in paradigm is needed to move away 
from developing robotic arms devoid of hands 
and hands devoid of arms. We must further 
ensure that hands are developed in a “plug- 
and-play” manner and can easily be attached 
and detached through existing tool-switching 
systems. State-of-the-art force and tactile sen- 
sors must become an inherent part of the arm- 
hand system. 

Robot dexterity is as much a by-product of 
advances in hardware as it is of advances in soft- 
ware. It requires suitable algorithms to rapidly 
and efficiently process the vast amount of in- 
formation collected through sensors and actua- 
tors. At the same time, it needs algorithms to 
adequately control the movement of a hand in 
relation to object, scene, and task properties. We 
next review advances in perception, control, and 
learning for manipulation. 


Perception for manipulation 


As for humans, robot perception for manipu- 
lation is multimodal (Fig. 4). Vision is instru- 
mental for recognizing and localizing objects. 
When associated with a database of existing 


objects, robot vision can help infer geometric 
and physical properties of known and even 
unknown objects (26), and this information is 
important for shaping the aperture of the hand 
and the forces to be applied. Proprioception— 
namely, knowledge of where the robot’s limbs 
are located—is needed to guide the arm and hand 
toward the object, with visual support to contin- 
uously track the object. Touch and force mea- 
surements become important once contact has 
occurred and the object is held or explored by the 
hand. The associated control algorithms are used 
to guide the grasp and/or to infer the object’s 
physical properties, such as rigidity and mass 
distribution, that may have been poorly estimated 
or unknown previously. Sound has also received 
attention recently as a means to infer an invisible 
object’s content and to monitor changes in 
content during manipulation (27). 

As an example, a robot is tasked with fetching 
a package of milk from a refrigerator. Before the 
robot holds the package in hand, it may not 
know how much milk the package contains nor 
the package’s actual weight. Given that the 
package may be made of cardboard, the robot 
needs to know the weight in order to apply a 
suitable grasping force and avoid destroying 


Fig. 3. Designing hands beyond human dexterity. Two thumbs would make it possible to execute 
screwing and unscrewing motions with one hand rather than two. This capability may be useful 
for robots and humans via prostheses. [Illustrations: Laura Cohen] 


Fig. 4. Manipulation is 
multimodal. Vision is used 
before contact, whereas 

haptics and sound are 

involved upon contact to 
estimate an object's 

physical properties that 

cannot be directly observed. 
[Photo: Learning Algorithms and 
Systems Laboratory, EPFL] 


21 June 2019 


3 of 8 


RESEARCH | REVIEW 


the package. In the case of milk, sound may 
also provide information about the viscosity 
when the package is shaken, as milk will sound 
different from another substance, such as yogurt. 

Over the past few years, major efforts have 
been undertaken to analyze visual information, 
and progress has been considerable. Neverthe- 
less, robots still struggle to recognize objects that 
are partially occluded (28), particularly when 
viewed from a moving camera or when an object 
moves in a robot’s hands (29). In comparison to 
developing vision algorithms, much less effort 
has been devoted to analyzing haptic informa- 
tion, given that solutions for covering entire 
hands with haptic sensors are still lacking. To- 
day, visual and haptic information are still used 
primarily in a sequential manner [e.g., with vis- 
ual information provided in the preparation 
phase and haptic data provided upon contact 
(30)], and only a few recent works integrate both 
modalities for recognition, grasping, in-hand 
adaptation, and shape reconstruction (31-35). 
In comparison, humans are proficient at alter- 
nating between different senses, from vision to 
touch and back, and can do so rapidly even if 
these senses change in processing frequency. 
By contrast, robots still lack the ability to decide 
what sensors to use, when to use them, and 
when to switch between sensors. 


Grasping: A stepping stone 


Before a robot can manipulate an object in hand, 
it must be able to grasp its fingers around the 
object. If grasping is conceptualized only as get- 
ting fingers around an object with no additional 
constraints considered, the challenge of grasping 
may appear to be solved. However, grasping an 
object is a far more daunting problem. For dec- 
ades, researchers have worked to establish the 
theory of how to form a stable grasp. This be- 
came an intricate mathematical exercise aimed 
at determining the minimal number and optimal 
positions of the fingertips on the object’s surface 
to ensure stability (36). 

Although it is valuable, most of this theoretical 
work relies on assumptions such as a known 3D 
model of the object, a rigid point contact, and no 
uncertainty in the process. To incorporate uncer- 
tainty originating from imperfect object models 
and dynamics in the interaction process, we 
must go beyond modeling a single point contact 
and pursue substantial advances in the basic 
theory. 

Thus, many of the more recent approaches 
are data driven (37). To avoid computing an op- 
timal grasp each time a robot encounters an 
object, one can build a database of grasps and 
employ methodologies for sampling and rank- 
ing candidate grasps in real time. This approach 
deals with uncertainty in perception and pro- 
vides fast and online generation of grasps for 
known, familiar, and even unknown objects. Prior 
knowledge of object properties determines the 
necessary perceptual processing and associated 
object representations for generating and rank- 
ing grasp candidates. Although this method 
works well for known and familiar objects, un- 
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known objects necessitate additional heuristics 
for the discovery of geometric structures (e.g., 
handles, for which a robot would have a can- 
didate grasp). This challenge is closely related to 
the classical problems of instance recognition 
and categorization in computer vision, but the 
notion that grasping is not an isolated process 
adds a new dimension. 

In addition to being object dependent, grasps 
are also robot dependent. Moreover, as the num- 
ber of degrees of freedom of the hand increases, 
so does the complexity of the control. This is 
particularly an issue for anthropomorphic hands. 
One avenue of research to simplify the control 
draws inspiration from biology and promotes 
the use of postural synergies (38). Synergies form 
a basis of the subspace of effective human move- 
ments in relation to those that are possible by the 
kinematics of the body. These have been used as 
a tool for robot hand analysis, control, and de- 
sign choice (39-42). Several studies have also 
demonstrated how underactuated hands can 
be leveraged to grasp and manipulate objects in 
unstructured environments and how this work 
may lead to adaptive hands that are relatively 
cheap, lightweight, and easy to control compared 
with fully actuated hands (43-48). More recent 
work has optimized hand design to improve 
manipulation capabilities (49, 50), providing 
open-source software for such design. Other 
recent work has suggested that the ability of 
compliant hands to deform in and with the 
environment may reduce the cognitive load of 
manipulation (57). Furthermore, this idea can 
be studied systematically using morphological 
computation (52), in which compliant interac- 
tions allow adaptation of behavior to a particular 
context, without the need for explicit control. 


From grasping to manipulation 


Grasping is not an end on its own; it is also rel- 
ated to the task a human or robot is executing. 
For example, one grasps a cup differently de- 
pending on whether the goal is to drink from it, 


Fig. 5. Grasp functionality. (Top) A 
human grasps an item differently 
depending on whether the aim is to 
hold it, open the cap, or hand it to 
someone else. (Bottom) Robots can 
also be programmed to hold the 
same glass differently depending on 
whether they are tasked with 
handing it to a human or pouring out 
its contents. [Photos: Learning 
Algorithms and Systems 
Laboratory, EPFL] 
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fill it with fluid, put it in a dishwasher, or serve 
it to another person (53) (Fig. 5). Similarly, al- 
though a knife, fork, or spoon may be held with 
the same grasp when used to mix soup, this 
grasp differs from those employed when these 
utensils are used for eating or cutting. To de- 
termine the optimal way to grasp an object, 
one must understand the purpose of the grasp. 
Hence, while roboticists aimed to solve the prob- 
lem of how to grasp an object, they first had to 
identify the reason for executing the grasp. To- 
day, researchers consider grasping as part of an 
overall plan for object manipulation. 

To determine the correct grasp to use with 
the correct tool, one must first have the correct 
tool at their disposal. When in need of a hammer 
but no hammer is in reach, a human will instead 
select the first object sturdy enough to act as a 
hammer. Future efforts to develop robots that 
can reason in this manner when the most ap- 
propriate tool is not available will be critical to 
facilitate deployment of robots in natural envi- 
ronments. Additionally, robots with this capabil- 
ity will be able to use tools originally designed 
for human dexterity to perform household tasks 
without making undesired modifications to our 
households. How to program such “common 
sense” tool use is hence an important avenue 
for research, and some initial work has been 
conducted in this direction (54-57). 


Manipulations that remain difficult 


The previous sections detail the many problems 
that remain to be solved before robots can per- 
form grasps with a human level of intelligence. 
This said, robots are already fairly efficient at 
grasping and releasing certain types of objects. 
They are also capable of performing a variety of 
simple manipulation actions such as throwing 
(58), sliding (59), poking (60), pivoting (67), and 
pushing (62). Difficulties arise when these ac- 
tions must be performed in cluttered environ- 
ments or require contact-rich interactions (e.g., 
when an object of interest is placed close to or 
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Fig. 6. Remaining challenges for robot manipulation. Dexterous movement of objects within the hand (left), manipulation of deformable objects (e.g., 
ation of objects in collaboration with humans (right) present ongoing difficulties. [Photos: Learning 


fruits and vegetables) (65) (middle), and manipu 
Algorithms and Systems Laboratory, EPFL] 


is covered by other objects or is located in a 
confined space such as a shelving unit). It is ne- 
cessary to plan a feasible path and generate a set 
of intermediate actions to ensure no damage to 
the hand or other objects. Today, it is also recog- 
nized that perception and control are tightly 
coupled, and the field of interactive perception 
(63) regards manipulation as a means to perceive 
and perception as a means to achieve better 
manipulation. 

Manipulation actions that generate changes 
on the object (cutting, crushing) remain partic- 
ularly difficult, as they require a model of the 
deformation and advanced perception to moni- 
tor the alterations (64). To facilitate adaptation 
to the changes induced, these actions also neces- 
sitate that the forces be applied by the hand (e.g., 
a reduction of friction when unscrewing a bottle 
cap, an increase in viscosity when digging into a 
melon) (Fig. 6) (65). Thus, modeling an object’s 
friction and viscosity properties is still an im- 
portant open problem. 

In-hand manipulations in which an object is 
moved while being held are also particularly 
complicated. Examples include twirling a pen 
across fingers or preparing a key to be inserted 
into a keyhole. These actions comprise an exten- 
sive combination of (re)grasping movements 
and sliding and rotating maneuvers, as well as 
interactions between two arms and hands, in some 
cases. When discussing such advanced interac- 
tions with objects in robotics, we commonly talk 
about intrinsic and extrinsic dexterity. The former 
denotes the ability of the hand to manipulate 
objects using its available degrees of freedom. 
Hands with high intrinsic dexterity often mimic 
the structure of the human hand (66). Alterna- 
tively, the hand can be simpler, and the end- 
effector is designed specifically for a particular 
task (67, 68). Extrinsic dexterity is the ability to 
compensate for the lack of degrees of freedom by 
using external support, such as friction, gravity, 
and contact surfaces (69). This functionality also 
enables dexterous manipulation with simple pa- 
rallel grippers. 

One of the largely underdeveloped areas in 
robotics is dual-arm or bimanual (70) manip- 
ulation, as well as the use of the second hand 
and/or arm to support both intrinsic and ex- 
trinsic dexterity (77). Some recent work in this 
area (72) proposes integration of object repre- 
sentation, definition of simple movement pri- 
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mitives, and planning to model the problem in 
an efficient manner. This area will gradually 
produce more and more contributions in the 
future, given that most of today’s humanoid ro- 
bots have bimanual capabilities. Furthermore, 
manipulation does not stop at simply controlling 
the hand; it requires control of the arm, torso, 
and ultimately the entire body (73). The chal- 
lenges listed above only increase in scale when 
one wishes to enable a full humanoid robot to 
manipulate objects while maintaining its balance 
(74) (Fig. 7). Finally, control of more-complex 
manipulation skills that require reasoning, such 
as using an object to retrieve another object, are 
still in infancy. 


Learning for manipulation 


Human dexterity is a skill acquired during child- 
hood and further refined throughout life, in 
activities such as playing a musical instrument 
or practicing a craft. Similarly, robot dexterity 
cannot be achieved in the confines of our labor- 
atories. To be able to manipulate the vast array of 
objects that exist throughout the world, robots 
must be able to learn continuously, adapt their 
perception, and control unfamiliar objects. 
Learning also addresses some challenges linked 
to the lack of accurate models of objects and 
contact dynamics and the increasing complexity 
of control for robots with large degrees of free- 
dom. Hence, many of the present approaches to 
dexterous manipulation rely on learning meth- 
odologies in place of control-theoretic approaches. 
For instance, learning can be used to embed rep- 
resentations of stable or suitable grasps (75-78), 
which can then be applied to verify stability and 
generate regrasping motions at run time or to 
catch a fast-moving object (79). Learning is par- 
ticularly suitable for embedding the dynamic 
nature of grasping and manipulation, as well 
as for modeling manipulation of complex non- 
rigid objects. Learning has been used to model 
contacts (80) and is also beneficial for reducing 
control dimensions by determining latent space, 
as required in bimanual dynamics (65). 
Nevertheless, solving all problems by solely 
relying on learning is not a viable solution and 
has certain limitations. First, learning requires 
data for training, and a common approach is to 
generate data from trial-and-error experiments. 
However, this process is tedious and may dam- 
age the robot. A growing trend in providing 
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Fig. 7. Whole-body manipulation. Manipulation 
of a heavy object by a humanoid robot requires 
coordination of arms and body to maintain 
balance. [Reproduced from (74)] 


training data is to test the algorithms in sim- 
ulation first and then refine the learning on a 
real platform; e.g., for learning dexterous in-hand 
manipulation (87-83). Training in simulation de- 
pends on having an accurate simulator of the 
tasks. Alternatively, robots may learn from image 
data and videos available on the internet (84) or 
from demonstration by a live expert, usually a 
human. Yet it may not always be possible to find 
an expert, especially when the tasks are danger- 
ous or require extreme precision. Hence, although 
learning is important, it cannot be the answer to 
every problem in robotics. 
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Manipulating objects in interaction 

and collaboration with humans: Reality 
and challenges 

Human-robot collaboration in manufacturing 
setups has been deemed crucial for the industry 
(85, 86). Although historically, humans were pro- 
hibited from entering the robot’s environment 
(ISO 10218; ANSI/RIA R15.06-1999), it is now 
accepted that robots can work in close proximity 
to and in collaboration with humans. However, 
potentially dangerous scenarios may still occur 
and need to be addressed. Currently, human- 
robot collaboration is allowed through the use 
of manipulators that remain fairly light and are 
endowed with internal force sensors for detect- 
ing unexpected contact or collision with humans. 
For applications that require maneuvering heavy 
weight, robots capable of managing the weight 
may be coupled with an external vision system 
for monitoring human presence. Yet challenges 
remain for accurately detecting human pres- 
ence. At present, the best solution is to com- 
bine sensing of proximity and force with external 
vision-based monitoring. Nonetheless, the 100% 
fence-based safety paradigm is gone, and indus- 
trial standards now target risk minimization and 
mitigation (ISO/TS 15066). 

In addition to facing a world in which objects 
move and change, robots are now expected to 
manipulate these objects in collaboration with 
humans. Interactive and collaborative manipu- 
lation adds a new dimension to robot manipu- 
lation but presents a wealth of challenges (87). 
For example, when a robot is tasked with hand- 
ing an object to a person or carrying a large 
object jointly with someone, the robot must 
grasp and move the object carefully and with 
foresight, so that the robot can infer where the 
human will move and the human does not get 
injured. As simple as it may seem, the act of a 
robot handing an object to a human entails 
several complex questions, which have in turn 
inspired studies on how to enable a robot to 
properly perform this task (88-92). These ques- 
tions range from how to present an object for 
optimal human grasping to others related to 
social factors, such as the role of gaze, social 
cues, and awareness of user state. There is no 
agreement on what factors are most important 
in determining how handovers are carried out 
between two humans, let alone between a robot 
and a human. Although most research has foc- 
used on a robot handing objects to humans, 
there have also been studies on robots taking 
objects from humans (93-95). Additionally, sev- 
eral efforts have been aimed at enabling a robot 
to manipulate objects jointly with humans, and 
the act of carrying objects jointly with humans 
has been demonstrated using both humanoid ro- 
bots (96-99) and mobile manipulators (100-103). 
Notable recent efforts have explored human- 
robot joint manipulation of deformable mate- 
rials (104), helping humans to dress (105), and 
assistive support (106). 

Hence, for robots to work seamlessly with 
humans, researchers are striving to equip robots 
with the tools for better perception of humans 
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and more adaptive control modes. In addition, 
roboticists seek guarantees in terms of machine 
performance and the use of common evaluation 
scenarios and benchmarks. 


Outlook 


Since the 1960s, substantial progress has been 
achieved in several areas of robotic manipu- 
lation. We have established the basic theory 
of evaluating stability of a grasp, control algo- 
rithms that can adapt to unpredicted situations, 
and changing dynamics when the appropriate 
sensor feedback is available to perform state es- 
timation. Lately, the field has also seen advances 
in data-driven approaches in which even dexter- 
ous in-hand manipulation can be accomplished, 
but only for very specific problems and in highly 
tailored environments. Achievement of robust, 
flexible, and adaptive grasping and manipulation 
of completely unknown objects in media such as 
water and oil (not solely in air) is expected to 
result in a major manufacturing revolution, which 
will affect most of the work that relies on fine 
manipulation and high dexterity. However, sys- 
tematic development is ongoing toward several 
technologies vital for meeting and exceeding hu- 
man dexterity and fine manipulation capabilities. 

First, there is still a need for basic theoretical 
development. We must seek to understand and 
model soft point contact and provide stability 
rules for both point contacts and surface con- 
tacts. We also need to develop a better method 
for modeling objects whose states change mark- 
edly after manipulation (e.g., a cucumber after 
being sliced, an onion after being chopped). A 
thorough description of manipulation and task 
goals will be required for planning and gener- 
ating appropriate intermediate grasping and 
manipulation actions. This emphasis on theory 
and planning is also relevant for data-driven 
approaches, as we need better tools for sim- 
ulating soft bodies and generating relevant 
scenarios and examples that include force and 
torque information. 

In addition to the aforementioned modeling 
and software aspects, we also seek to achieve 
substantial progress in hardware development 
and design. One area of particular relevance is 
that of robot sensing. It will be important to 
develop skinlike sensors that are well integ- 
rated with hand design but do not require ex- 
cessive cabling or add substantial weight. This 
sensing functionality should facilitate force 
and torque measurements, determining shear 
forces to detect and counteract slippage. To 
achieve dexterous in-hand manipulation, we also 
need actuated hands that can be controlled with 
high frequencies. Such hands must function in 
different media (air, water, and oil) without 
being damaged or needing to be covered by 
special gloves. Overall, we need hands that are 
light, cheap, robust, and easily integrated with 
any type of robotic arm. 

Finally, an important industrial challenge will 
be to bring robots in closer proximity to humans 
and enable safe physical interaction and collab- 
oration. Fences that used to separate humans 
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from robots will disappear gradually. Robots 
will thus need to be engaged in collaborative 
tasks to jointly manipulate objects with humans 
while adapting to unexpected human behavior. 
Equipping robots with advanced physical inter- 
action capabilities to achieve safe and smooth 
synchronization of motion between machine and 
human is still a major hurdle. This objective will 
require advances in detailed tracking of human 
fine body movement, as well as a better under- 
standing of how humans collaborate and achieve 
joint goals through planning and direct physical 
interaction. Furthermore, there is a demand 
for robots that are safe by design, putting focus 
toward soft and lightweight structures as well 
as control and planning methodologies based 
on multisensory feedback. Human ways of acting 
will continue to serve as inspiration for future 
robot systems, and robots will serve as a tool for 
better understanding humans. 
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RESOLVING 


MINANTS 


By Laura M. Zahn 


Ruminants are one of the most diverse groups of mam- 
mals. All members have a specialized digestive organ 
known as the rumen, as well as other organs that per- 
mit them to subsist on plants. These adaptations allow 
ruminants to digest foliage more efficiently than other 
mammalian species and may underlie the success of this 
clade. Additionally, most ruminant species 
have impressive bony outgrowths, collec- 
tively referred to as headgear, that may aid 
in mate selection. These varied ornaments, 
some of which are exclusive to males, in- 
clude the horns carried throughout the lives 
of sheep, antelope, and cattle, as well as the 


An adult female reindeer 
(Rangifer tarandus) 

in Cairngorms, Scotland, 

UK. Reindeer are ruminant 
mammals that exhibit 
species-specific adaptations 
to their harsh 

Arctic distribution. 


annually regenerating deer antlers, one of the fastest- 
growing tissues of any mammal. Here, Science presents 
three studies that use genomic analyses to resolve the 
evolutionary relationships among the major ruminant 
lineages, identify the genes involved in the evolution 
of headgear, and examine how reindeer adapted to the 
long days and nights of the Arctic region. 
These studies allow us to better appreciate 
the familiar domestic cows, sheep, and deer 
and their wild relatives and to peer into 
the adaptations underlying the antelopes, 
giraffes, bison, and pronghorn that occupy 
the world’s grasslands. 
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Large-scale ruminant genome 
sequencing provides insights into 
their evolution and distinct traits 
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INTRODUCTION: The ruminants are one of 
the most successful mammalian lineages, exhib- 
iting extensive morphological and ecological 
diversity and containing several key livestock 
species, such as cattle, buffalo, yak, sheep, and 
goat. Ruminants have evolved several distinct 
characteristics such as a multichambered stom- 
ach, cranial appendages (headgear), special- 
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Phylogeny and trait evolution of ruminants. The phylogenic tree of ruminants is presented with 


Headgear 


ized dentition, a highly cursorial locomotion, 
and a wide range of body size variations. De- 
spite their biological prominence and value 
to human societies, the evolutionary history 
of ruminants has not been fully resolved, 
and the molecular mechanisms underlying 
their particular characteristics remains largely 
unknown. 


—_> 
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, ‘a 


Loss of headgear 
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the species within same families and subfamilies collapsed. The ruminants have many textbook 


examples of distinct traits. The four-chambered stomach with omasum chamber is a key innovation 
evolved in pecoran ruminants. Headgear keratinous sheath only appear in Bovidae and Antilocapridae 
lineages. Many ruminants have evolved high-crowned or hypsocont teethes. The Antilocapridae and 


two bovid lineages are among the mammals with highest cursorial locomotion ability. 
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} 
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Hippotraginae 
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10 0 Million years ago 


RATIONALE: We seek to resolve the controver- 
sies in the ruminant phylogeny and reveal the 
genetic basis underpinning the evolutionary 
innovations in ruminants. Here, we report the 
newly sequenced genomes of 44 ruminant spe- 
cies, covering about half the genera and all six 
extant Ruminantia families. We included seven 
published ruminant genomes (five bovids and 
two cervids) to reconstruct the phylogenetic tree 
by using improved time calibrations. We also 
reconstructed the Pleistocene demographic his- 
tories of these ruminant species using whole- 

genome heterozygosity in- 
formation. Together with 
Read the full article  Wamscriptomic data of 516 
at http://dx.doi. samples from 68 tissues 
org/10.1126/ of four species, we con- 
science.aav6202 ducted comparative ge- 

nomic analyses to reveal 
candidate genes and regulatory elements that 
might have contributed to the evolution of the 
distinct ruminant characteristics. 


RESULTS: Using whole-genome orthologous 
sequences obtained from 51 ruminants, we have 
produced a new well-supported ruminant phy- 
logenetic tree. The new tree resolves previous 
controversies over the deep branches of rumi- 
nant families, as well as the highly radiated 
Bovidae family. We estimated the emergence 
of crown Ruminantia to the late Oligocene 
(39.1 million to 32.3 million years ago) and that 
of Pecora to the Neocene (23.3 million 
to 20.8 million years ago). Investiga- 
tions of demographic history revealed 
massive population decline events that 
occurred in most ruminant species, 
starting from ~100,000 to 50,000 years 
ago, which was temporally and spatially 
concurrent with the increased human 
activities on different continents during 
this period. We further identified many 
genomic changes that associate with 
important evolutionary innovations, 
such as the multichambered stomach, 
headgear, body size variation, cursorial 
locomotion, and dentition. 


CONCLUSION: Our results demon- 
strate the power of using comparative 
phylogenomic approaches in resolving 
the deep branches of phylogeny that 
result from rapid radiations. The data 
and results presented in this study pro- 
vide valuable resources and insights 
into the evolution of ruminant and 
mammalian biology. 
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The ruminants are one of the most successful mammalian lineages, exhibiting morphological 
and habitat diversity and containing several key livestock species. To better understand 
their evolution, we generated and analyzed de novo assembled genomes of 44 ruminant 
species, representing all six Ruminantia families. We used these genomes to create a time- 
calibrated phylogeny to resolve topological controversies, overcoming the challenges 

of incomplete lineage sorting. Population dynamic analyses show that population declines 
commenced between 100,000 and 50,000 years ago, which is concomitant with expansion 
in human populations. We also reveal genes and regulatory elements that possibly contribute 
to the evolution of the digestive system, cranial appendages, immune system, metabolism, 
body size, cursorial locomotion, and dentition of the ruminants. 


uminantia is an important group of ter- 
restrial herbivores, including at least 200 
extant species (7) spanning six families: 
Tragulidae, Antilocapridae, Giraffidae, 
Moschidae, Cervidae, and Bovidae. The 
most species-rich of these families is Bovidae, 
which encompasses at least 143 species (2, 3), 
including important livestock animals (such 
as cattle, buffalo, yak, sheep, and goat) (4, 5). 
Ruminants possess several distinct and charac- 
teristic anatomical hallmarks, such as a multi- 
chambered stomach and cranial appendages 


(headgear). The acquisition of the rumen in the 
ruminants and of the omasum in pecorans (all 
ruminants except the Tragulidae) allow these 
animals to use plant material with a higher ef- 
ficiency than other herbivorous mammals, such 
as equids (6-8). This effect is believed to be as- 
sociated with the evolutionary success of these 
animals in terms of diversity, abundance, and 
geographic range (9). Ruminants have also evolved 
extreme morphological diversity, ranging in body 
weight from <2 kg to >1200 kg (0, 12), and a wide 
variety of distinct behavioral and physiological 


traits. Ultimately, these adaptations in ruminants 
likely explain the remarkable abundance of these 
animals among domestic animals. 

Despite their biological prominence and value 
to human civilization, much remains to be learned 
about the ruminants. For example, the phylogeny 
of ruminants is far from resolved, and inconsist- 
encies remain even at the family level (12-15). 
Moreover, the genetic basis underlying many 
of their characteristic traits remains unknown. 
To fill in our gaps in knowledge, we performed 
de novo assembly of the genomes of 44: ruminant 
species, representing 36 genera that span all 
six families. In combination with five previously 
published bovid genomes, two published cervid 
genomes (16-22), and recently updated fossil in- 
formation (75), we constructed a time-calibrated 
phylogenetic tree of the group, analyzed species 
population histories, and investigated the genomic 
evolution of these species. Our results not only 
provide data for understanding the origin and 
evolution of this important mammalian group 
and their particular traits but also have impli- 
cations for placing ruminant livestock genomic 
resources into an evolutionary context and for 
conserving ruminant biodiversity. 


Results 
Genome sequencing, assembly, 
and annotation 


We used Illumina sequencing technology (23) 
to generate more than 40 trillion base pairs 
(Tbp) of raw data and then de novo assembled 
genomes for 44 ruminant species (Table 1 and 
tables S1 and 82). Eleven of the species are listed 
as “vulnerable” or worse on the International 
Union for Conservation of Nature (IUCN) Red 
List (table S1). Forty of the genomes were as- 
sembled into large scaffolds (Table 1 and tables 
83 and S4) with a series of mate-paired large- 
insert libraries, whereas four genomes—common 
eland (Taurotragus oryx), mountain nyala 
(Tragelaphus buxtoni), bongo (Tragelaphus 
eurycerus), and oribi (Owrebia ourebi)—were 
only assembled to the contig level becausse of 
degenerated samples but still qualified for most 
comparative genomic analyses (Table 1 and table 
S4). In addition, we improved the quality of the 
genome assembly of the black muntjac deer 
(Muntiacus crinifrons; contig N50 = 2.4 Mbp) 
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Fig. 1. Phylogeny of ruminants. (A) The maximum likelinood phylogenetic tree fromm whole-genome sequences of 51 ruminant species and 13 fossil 
calibrations. To compute the node supports, 200 bootstraps were used, and all nodes have 100% support. The origin and credit of the portraits 
of different species are listed in table S54. (B) Prevalent discordance among 10,000 random WGTs was observed across different families of ruminant. 
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using long reads generated from PacBio Single 
Molecule, Real-Time (SMRT) sequencing. This 
improved assembly of a Cervidae species facilitated 
synteny and other comparative analyses within the 
ruminants. Three genomes—reindeer (Rangifer 
tarandus), milu deer (Elaphurus davidianus), 
and Marco Polo sheep (Ovis ammon)—were re- 
leased in data notes before this study (24-26). 

To evaluate the quality of these genome assem- 
blies (fig. S1 and tables S5 to S7), we performed 
BUSCO and synteny analyses and observed high 
BUSCO (27) scores (average 87%, from 75 to 93%) 
(table S6) and synteny continuity (table $7), showing 
that the majority of the assemblies were of high 
quality for downstream comparative analyses. 
The assembled genomes ranged in size from 2.52 
to 3.25 Gbp, which was mostly consistent with the 
cytological C values (28) and k-mer-based esti- 
mations (table S5), indicating relatively complete 
genome coverages for all species. 

After masking repeats (table S8), we used both 
de novo and homology-based gene prediction to 
annotate the genomes. The protein sequences 
of human (Ensembl 87 release), cattle (Ensembl 
87 release), and sheep (Ensembl 87 release) were 
used as templates for homology-based gene pre- 
diction. The final annotated gene numbers range 
from 19,304 to 25,753 (table S9) among different 
species, with variations driven primarily by the 
quality of the assembled genomes. The gene length 
and exon length distributions were similar to those 
of other mammals (fig. S2 and table S9). We also 
identified 221,166 ruminant-specific conserved 
nonexonic elements (RSCNEs) by comparing all 
the ruminant genomes to 12 mammalian outgroup 


species (fig. S3 and table S10). These RSCNEs 
occupy ~0.61% of the genomes (~16.5 Mbp) (table 
S11) (23). A user-friendly, publicly available ge- 
nome browser database was established (http:// 
animal. nwsuaf.edu.cn/code/index.php/Ruminantia) 
for visualization of the genomic and transcrip- 
tomic data presented in this study. 


Resolving the ruminant phylogeny 


The lack of a fully resolved phylogeny for rumi- 
nants (72-15) still hinders an understanding of 
the evolution of ruminant diverse phenotypes. 
In particular, the phylogenetic positions of the 
Antilocapridae and Moschidae families have been 
strongly debated, and so have the relationships 
among subfamilies within the diverse Bovidae 
family [(72), review]. Furthermore, although “dwarf 
antelopes” have previously been grouped in the 
tribe Neotragini, recent molecular studies sug- 
gest that these animals are polyphyletic (14, 15). 
These controversies are probably attributable to 
convergent evolution (challenging morphology- 
based approaches) and incomplete lineage sort- 
ing (ILS) in conjunction with the short internal 
branches in the ruminant radiation (challenging 
genetic inference) (72). 

To resolve these phylogenetic challenges, we 
first estimated a whole-genome phylogenetic tree 
with ExaML under the GTR+GAMMA model (23) 
using the killer whale genome (29) as an out- 
group. In total, 373 Mbp of orthologous syntenic 
sequences were obtained from whole-genome 
alignment by using the goat as a reference genome 
(19), yielding a whole-genome tree with 100% 
bootstrap support for all nodes (Fig. 1A). We 


also performed phylogenetic analyses with other 
genome partitions, including 6406 orthologous 
protein-coding genes identified in the 51 rumi- 
nant species and the killer whale, the fourfold 
degenerate sites in these genes, conserved non- 
exonic elements (CNEs), and complete mitochon- 
drial genomes (mtDNA). With the exception of 
the mtDNA tree, the topologies of all other trees 
were identical with that of whole-genome tree 
(Fig. 1A and figs. S4 to S10). 

Although the phylogenetic tree constructed 
with concatenated nuclear genome sequences 
(nDNA tree) was highly supported, phylogenetic 
discordance was pervasive across genomic re- 
gions (Fig. 1B). To further assess genome-wide 
tree discordance, 10,000 random genomic 1-Kbp 
windows at least 50 Kbp distant from each other 
were extracted. These 1-Kbp windows had enough 
segregating sites to generate window-based gene 
trees (WGTs) of high resolution (fig. S11). Although 
all individual WGTs differed from the species-level 
nDNA tree topology, 21.3% of WGTs exhibited 
the same family-level topology as that of the con- 
catenated nDNA tree (table S12). This type of tree 
incongruence is usually caused by ILS, which has 
been widely observed in phylogenies of many ani- 
mal groups—such as African cichlids (30), birds 
(31), and great apes (32, 33)—and is known to be 
exacerbated by the effects of gene tree estima- 
tion error (34, 35). 

To ensure that the reconstructed phylogeny is 
robust in the presence of ILS, we performed seve- 
ral analyses with the coalescent-based phylogenetic 
method ASTRAL (36) using the 10,000 WGTs. 
We did not use entire genes as the unit of gene 


Table 1. Assembly statistics of 44 ruminant species. 


Species Common name 


Tragulus javanicus Lesser mouse-deer 


243,250 


Scaffold N50 Contig N50 


Species 


(bp) (bp) 


6286 Redunca redunca 


Common name 


Bohor reedbuck 


Scaffold N50 Contig N50 
(bp) (bp) 
438,845 17,874 


Antilocapra americana Pronghorn 


Muntiacus muntjak Indian muntjac 
Elaphurus davidianus Milu 
Muntiacus crinifrons 
G 


Moschus chrysogaster 


Black muntjac 


Forest musk deer 


Damaliscus lunatus Topi 
Pseudois nayaur Blue sheep 
Kobus ellipsiprymnus Defassa waterbuck 
P 


Connochaetes taurinus Blue wildebeest 


1,398,591 


3,040,530 


1,172,125 


2,076,308 


782,102 


10,925 Nanger granti 


32,708 Aepyceros melampus 


NA 1,458,913 Oreotragus oreotragus 


Tragelaphus imberbis 


Raphicerus campestris 


Alcelaphus buselaphus 


Neotragus moschatus 


African buffalo 


Grant's gazelle 520,131 6041 


Impala 343,699 Bil SV) 


Klipspringer 340,335 13,019 


Lesser kudu 


Steenbok 474,033 


Hartebeest 11,258 


Suni 957,022 


Tragelaphus scriptus Bushbuck 
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3,511,341 46,638 Tragelaphus buxtoni 
365,462 SSL ls Tragelaphus euryceros 
890,554 9965 Ourebia ourebi 


Mountain nyala no scaffold 1309 
Bongo no scaffold 1980 
Oribi no scaffold 1259 
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tree construction because genes can span over 
several recombination blocks in the genome (37). 
The topology of the obtained ASTRAL coalescent 
tree (fig. S12) is identical to the nDNA tree (Fig. 1A). 
We also used another coalescent-based phylo- 
genetic method, MP-EST (38), to carry out ad- 
ditional phylogenetic analyses using both the 
10,000 windows as well as the 6406 orthologous 
genes and obtained an identical topology as those 
of the ASTRAL and nDNA trees (fig. S13), further 
supporting the robustness of the inferred rumi- 
nant phylogeny. To minimize negative impacts of 
low tree resolution in the WGTs, we contracted 
low-supported branches [below 3, 10, and 20%, 
bootstrap support, as suggested in (36)] and still 
observed the same topology as that of the nDNA 
tree in all cases (fig. S14). 

We further tested whether gene flow could 
have contributed to the observed topological 
discordances. The results from the ABBA-BABA 
test (39), the Dgor,, software (extend from 4 taxa 


to 5 taxa) (40), and admixturegraph (4/7) con- 
sistently suggested some possible ancient gene 
flows among Giraffidae, Cervidae, Bovidae, and 
Moschidae (fig. S15 and table S13). The stron- 
gest gene flow signal between Giraffidae and 
Cervidae-Bovidae-Moschidae coincides with an 
overrepresentation of this topology in the WGTs 
relative to the expectation of ILS, which requires 
that for a set of four species trees, alternative two 
phylogenetic topologies other than the species 
tree have equal proportion (fig. S16). By contrast, 
PhyloNet (42) and PhyloNetworks (43) were sensi- 
tive to model and parameter choices and did not 
generate consistent results (figs. S17 and S18). It 
is plausible that the high parameter space of the 
phylogenetic model precluded these methods from 
performing complete explorations of the parameter 
space in our large-time-scale phylogenomic data. 

Our new ruminant tree supports a sister-group 
relationship for Antilocapridae and Giraffidae, 
representing the oldest branch among the ex- 


tant pecoran families. This is different from the 
mtDNA-based phylogenies, which placed Anti- 
locapridae as an outgroup to all other pecorans 
(figs. S9 and S10) (12, 14, 15). The sister relation- 
ship of these two groups has been proposed be- 
fore (44, 45) but now received 100% bootstrap 
support by the nDNA tree and a local posterior 
probability (46) of 1 in the ASTRAL tree and had 
23% higher frequency of WGTs than those of the 
alternative topologies (fig. S16). Another con- 
tentious issue is on the placement of Moschidae, 
which was previously placed at the base of Pecora 
(47-49) or as a sister group to Bovidae (50) from 
morphological data. In some mtDNA studies, 
Moschidae was proposed as a sister group of 
Cervidae (57) or a sister taxon to Bovidae (/4, 52, 53). 
Our nDNA tree confirmed that Moschidae is a 
sister group of Bovidae, and the frequency of 
WGTs further support this conclusion (fig. S16). 

Our results also resolved some controver- 
sial subfamily relations in Bovidae, such as the 


A OF Fig. 2. Population size history of ruminants. (A) The normal- 
0.6 ized effective population sizes (N.) of Ruminantia shows a 
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placement of the Reduncinae (72). With the 
whole-genome tree, Reduncinae were confirmed 
to be a sister group to the ancestral lineage of 
Caprinae, Alcelaphinae, and Hippotraginae. This 
topology is also supported by the ASTRAL tree 
and has a slightly higher WGT frequency than 
those of alternatives (fig. S19). We further con- 
firmed the previously ambiguous sister-group re- 
lationship between impala (Aepyceros melampus) 
and suni (Neotragus moschatus) (54, 55) and found 
no phylogenetic support for the “waste-bucket” 
Neotragini tribe (56). These findings mostly agree 
with the topology from Decker et al. (57), which 
used single-nucleotide polymorphism (SNP) chip 
data but lacked samples from Moschidae and 
Tragulidae. Furthermore, our results provided 
higher bootstrap supports for the deep nodes 
and also refined some species positions within 
Bovidae—the position of bushbuck (Tragelaphus 
scriptus) and mountain nyala (Tragelaphus 
buxtoni). 

Using fossil calibrations (tables S14 and S15) 
(15), we estimated the emergence of crown Ru- 
minantia at 39.1 million to 32.3 million years ago 
(late Oligocene), and the emergence of Pecora at 
23.3 million to 20.8 million years ago (Neocene) 
(Fig. 1A and figs. S20 to S27). The evolutionary 
rate in the ancestral ruminant lineage was ~1.5 x 
10°, which was significantly higher than that in 
other mammals (Student’s ¢ test, P < 0.01) (fig. 
$28). Among the ruminant families, Tragulidae 
had the highest evolutionary rate, and Giraffidae 
had the lowest evolutionary rate (fig. S28). We 
found a significant negative correlation between 
evolutionary rate and log body size (fig. S29). 


Pleistocene population dynamics 
in ruminants 


We used our whole-genome dataset to investigate 
the demographic histories of different ruminant 
species using the pairwise sequentially Markovian 
coalescent (PSMC) method (58), which can infer 
changes in the effective population size (V.) over 
the Pleistocene (figs. S30 and S31 and table S16). 
The analyses produced a species-specific demo- 
graphic pattern with no clear grouping of patterns 
according to their habitat types or feeding types 
(fig. S32). This might imply that the ruminant 
species responded differentially to biotic and 
abiotic pressures associated with their differ- 
ent ecological niches (59). However, we found 
population declines for many species (25 out of 
the 40 species with scaffolded genome assem- 
blies) starting from ~100,000 to 50,000 years ago 
(Fig. 2A), which suggests that late Pleistocene 
large mammal declines were much more severe 
than previously suspected, involving major de- 
clines in the populations of most species along 
with the mass extinction of large mammals at 
this time (60). We speculate that this community- 
wide decline might be at least partially attrib- 
utable to human activities. This is supported by 
ruminant population declines coinciding with 
increasing human effective population size after 
the dispersal out of Africa during this period 
(Fig. 2A) (61). Intriguingly, the onset of ruminant 
decline coincided with the sequential population 
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expansion of humans on different continents 
in this period (Fig. 2B), which would not be 
expected if the decline signal was a structural 
artefact. By contrast, ruminant population size 
dynamics showed no apparent correlation with 
the climatic changes dominated by serial glaci- 
ations during the Pleistocene (figs. S30 and S31). 
Taken together, these results highlight a possi- 
ble role for humans in the massive decline of 
mammalian species in the late Pleistocene (60). 


Structure and evolution of 
ruminant genomes 
Synteny 


We explored the general architecture of ruminant 
genomes through comparison with other high- 
quality mammalian genomes (23). Specifically, 
we compared the syntenic relationship between 
the goat (the most well-assembled ruminant 
genome) (79) and human (Primates) (62), dog 
(Carnivora) (63), horse (Perissodactyla) (64), pig 
(Suina) (65), camel (Tylopoda) (66), killer whale 
(Cetacea) (29), and black muntjac (Cervidae of 
Ruminantia). The high-quality assembly of the 
black muntjac (Table 1) by use of PacBio reads 
allowed us to assess the within-ruminant synteny 
by including both a Cervidae representative as 
well as the high-quality Bovidae representative 
(goat). We found that Primates and Perissodactyla 
have more prevalent genome rearrangements 
(>200 rearrangements per million base pairs) 
compared with that of Ruminantia (Fig. 3A and 
table S17). Within the Cetartiodactyla, pig (2n = 18, 
Suina) and camel (2n = 74, Tylopoda) experienced 
more genome rearrangements (~120 rearrange- 
ments per million base pairs) than the killer whale 
(2n = 44, Cetacea). Thus, the greater level of con- 
served synteny between Cetacea and Ruminantia 
(68 rearrangements per million base pairs) is con- 
sistent with their sister relationship within the 
Cetartiodactyla. Few rearrangements (19 rear- 
rangements per million base pairs) were observed 
between the black muntjac (2n = 8/9) and the 
goat (2n = 58), suggesting that synteny has been 
conserved among different lineages of Ruminantia, 
despite large variation in chromosome numbers 
(table S18). 


Genome size 


Our ruminant genome assembly sizes ranged 
from 2.52 Gbp [oribi (Owrebia ourebi)] to 3.25 Gbp 
[klipspringer (Oreotragus oreotragus)] (table S5). 
The average assembled genome size of rumi- 
nants was (2.7 Gbp), which is larger than Car- 
nivora (~2.3 Gbp), Perissodactyla (~2.4 Gbp), 
Suina (~2.5 Gbp), Tylopoda (~2.0 Gbp), and 
Cetacea (~2.4 Gbp) and smaller than Primates 
(~3.0 Gbp) (Fig. 3B). Analyses (23) confirm that 
transposable element (TE) content is the major 
cause of genome size variation (Fig. 3, B and C, 
and table S19). 

When comparing the goat genome with the 
human, horse, pig, and killer whale genomes, we 
also observed and validated (23) large insertions 
and deletions (over 50 kbp in length) in ruminants 
(table S20). The largest insertion from segmental 
duplication in the goat contains a cluster of PAG 


(pregnancy-associated glycoprotein) genes, with 
36 coding sequences and 32 pseudogenes (Fig. 3D). 
The main functions of PAGs are immune regu- 
lation and maintenance of pregnancy (67). We 
also found other insertions with important gene 
annotations in ruminants, including interferon 
and olfactory receptors (fig. S33 and table S21). 


Evolution of genes and gene families 
in Ruminants 


We obtained a high-confidence orthologous gene 
set for the full set of 51 ruminant species using 
camel, cat, dog, horse, human, minke whale, killer 
whale, and pig as outgroups (23). Using the re- 
solved phylogeny of Ruminantia, we identified 
rapidly evolving genes (REGs), positively selected 
genes (PSGs), and newly evolved genes (Fig. 4A 
and tables S22 to S25) (23). 

Functional enrichment analyses of the PSGs 
(tables S26 and S27), the REGs (table $28), and 
expanded gene families (table S29) in the ances- 
tral ruminant branch all exhibit enrichment in 
immune functions. As many as 20 PSGs and 12 
REGs are involved in the crossing of blood vessels 
by leukocytes, which respectively constitutes 17.9 
and 10.7% of this key pathway in active immuni- 
zation (Fig. 4B and tables S30 and S31) (68, 69). 
We also observed gene family expansions in the 
ruminant ancestor (table S32), including the in- 
terferon family (IFNs) (figs. S34 to S36), PAG fam- 
ily (fig. S34), and cathelicidin and serpin peptidase 
inhibitor families (figs. S34, $37, and S38), which 
are all involved in immune system pathways. In ad- 
dition, we identified a rumen-specifically expressed 
newly evolved gene, ENSBTAG00000038127 
(23), which contains an immunoglobulin V-set 
domain, usually involved in mimicking the anti- 
body variable domain of several diverse protein 
families (70). Although rapid evolution of immune 
genes was found in a wide range of animals, the 
number of PSGs from the leukocyte transendo- 
thelial pathway is highest in Ruminantia and 
could suggest a key role of this pathway in the 
evolution of ruminant immune system (table S33). 

In addition to immune system-related genes, 
we also observed a series of PSGs, REGs, and 
expanded gene families in ruminants involved 
in lipid metabolism, glycolysis, oxidative phos- 
phorylation, and amino acid metabolism (Fig. 4, 
Cand D, and table $34) (23). Ruminants have a 
distinct digestive system, in which the main 
source of energy comes from volatile fatty acids 
(VFAs) and the rate of glycolysis is low (71). The 
genomic changes associated with metabolism 
may reflect the adaptation of the digestive sys- 
tem in ruminants and may therefore have played 
important parts in the success of the ruminants. 


Genomic variations related to ruminant 
morphological characteristics 


We were able to leverage our phylogenetic tree 
and genomic data to conduct evolutionary ge- 
nomic analyses aimed at identifying genomic 
variations correlated with particular ruminant 
characteristics. Specifically, we investigated the 
evolution of the multichambered stomach, head- 
gear, body size, cursorial locomotion, and dentition. 
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Evolution of the multichambered stomach 
Whereas pecoran ruminants have a four- 
chambered stomach composed of the rumen, 
reticulum, omasum, and abomasum, the basal 
group of Ruminantia, the Tragulidae, lacks the 
omasum (72). 

Using a transcriptomic dataset from 516 sam- 
ples covering 50 tissues of sheep (table S35) (23), 
we found that the gene expression profiles of 
rumen, reticulum, and omasum are closest to that 
of esophagus (Fig. 5A), implying that the three 
stomach chambers might have originated from 
the esophagus, as suggested by Warner (73) and 


Xiang et al. (74). Regarding the distinct rumen 
organ, we found that several newly evolved genes 
played a role in its function. Among the 295 newly 
evolved genes identified in the ancestor of rumi- 
nants (Fig. 4A and fig. S39), seven were highly 
or specifically expressed in the rumen (fig. S40). 
Two genes, PRD-SPRRII and TCHHL2, are im- 
portant structural genes in the rumen (16). Three 
newly evolved serpin genes may have inhibitory 
functions of different proteases or through anti- 
inflammatory functions (75, 76), and two KRT6A 
genes are likely involved in the activation of 
follicular keratinocytes (77, 78). 


Although the abomasum is analogous to the 
true stomach in other mammals, it is hypothe- 
sized that the ruminant abomasum has evolved 
particular adaptations to digest the microbe-rich 
content from upstream chambers (72). The lyso- 
zyme c family, which degrades bacterial and mi- 
crobial cells as members enter the abomasum to 
extract nutrients (79), has expanded to 10 or more 
copies in ruminants, whereas other mammals 
have only one or a few copies of this gene fam- 
ily (79). Our comparative genomics analyses re- 
vealed that the duplication of lysozyme c genes 
began in the ancestors of Ruminantia, continued to 
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Fig. 3. Structural characteristics and evolution of ruminant genomes. 
(A) Goat (19) was used to represent Ruminantia-specific genomic 
rearrangements in comparisons with Primates (human), Perissodactyla 
(horse), Suina (pig), Tylopoda (camel), and Cetacea (Killer whale). 
Syntenic blocks are linked between genomes in a circos plot. The red 
number beside each circos quantifies the occurrences of rearrangement 
events per aligned megabase (Mb) sequence. The high-quality de novo 
assembly of the black muntjac was included for within-ruminant synteny 
inference. (B) The average genome sizes, TE sizes, and contents of different 
TE types of Primates (human, chimpanzee, gorilla, and orangutan), Carnivora 
(dog, cheetah, and polar bear), Perissodactyla (horse, przewalski's horse, 
and rhinoceros), Tylopoda (dromedary camel, bactrian camel, and alpaca), 
Suina (pig), and Cetacea (minke whale, killer whale, beluga whale, sperm 
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whale, and yangtze finless porpoise). Overall, the average genome size of 
Ruminantia is significantly (Student’s t test, P < 0.01) larger than those 

of Canivora, Perissodactyla, Tylopoda, Suina, and Cetacea. Tragulidae 

is marked with dashed lines because the genome assembly contains more 
gaps, which hindered the annotation of TEs. The proportions of LINE, 
SINE, LTR, and DNA transposons are presented in the stacked bar plot. LINE/ 
BovB and LINE/L1 are highlighted here to present their dynamic changes 
among ruminants. ***P < 0.01. (C) The average contents of different SINE 
types are plotted in the stacked bar plot across mammalian orders and 
suborders of Cetartiodactyla, with different colors. (D) A large fragment 
insertion of 3,396,232 bp is observed in the goat genome, which is also 
validated in other ruminant families, containing a cluster of PAG genes, 
specifically containing 36 coding genes and 32 pseudogenes in goat. 
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expand along the diversification of pecoran families, 
and eventually culminated with the highest copy 
numbers in Bovidae (Fig. 4D). This accumulated 
evolution of lysozyme c gene copy number in rumi- 
nants may be associated with an improved harvest- 
ing of nutrients from the rumen microbiome. 

As a newly evolved organ in the pecorans, the 
omasum is adjacent to the reticulum, whereas its 
expression profile is closest to that of the rumen. 
This expression overlap may be linked to the 
resemblance in structure and function of the 
rumen and omasum, which are both involved in 
the absorption of water, minerals, electrolytes, 
and VFAs, whereas the reticulum mainly has a 
different function as a filter for the fermenta- 
tion products from the rumen (6). Anatomically, 
the omasum is composed of the same stratified 
squamous epithelium of mucosal-layered tissue 
as the rumen and reticulum, which is different 
from the abomasum and intestines (6). To fur- 
ther reveal the genetic basis underlying the evo- 
lution of the omasum in pecorans, we identified 
75 genes that were specifically highly expressed 
in the omasum compared with other organs 
(fig. S41). Among these, one gene was newly 
evolved (LOCI01107119), and another (SCNNID) 
exhibited pecoran-specific amino acid changes 
relative to Tragulidae and killer whale genomes 
(fig. S42). LOC101107119 is annotated as a pros- 
taglandin F synthase 1-like gene and might have 
a similar function to that of the prostaglandin 
F synthase 1 (PGDF1), which is involved in ke- 
tone metabolism (80). SCNNID encodes the 6 
subunit of the epithelial sodium channel, which 
mediates Na* reabsorption and water absorption 
in the digestive tract (87). 

Transcriptional factors and regulatory ele- 
ments may have played important roles in the 
evolution of the omasum by changing the expres- 
sion patterns of their host genes to be recruited 
in omasum functions. We found four genes with 
pecoran-specific CNEs within their immediate 
upstream/downstream 10-kbp regions (S/M2, 
PAX9, KCNK5, and DENND2C) (table S36), of 
which PAX9 and SIM2 are important transcrip- 
tional factors. PAX9 regulates squamous cell dif- 
ferentiation in the esophageal epithelium (82). A 
CNE with two pecoran-specific mutations was 
found at the 5’ upstream of PAX9 in pecorans, 
which might play a role in regulating PAX9 ex- 
pression in the omasum. SZ/2 is highly expressed 
in the omasum but inactive in the rumen [frag- 
ments per kilobase of exon per million fragments 
mapped (FPKM) < 1], and it works as a suppres- 
sor of cell proliferation in the epithelium (83, 84). 
The differential expression patterns of S72 in 
the omasum and rumen is consistent with the 
epithelial cells in the omasum not being com- 
pletely renewed, as is the case in the rumen (85). 


Evolution of headgear 


The ruminant families exhibit spectacular vari- 
ation in headgear morphology (86). To reveal the 
genetic basis of headgear origin in ruminants, its 
secondary loss in two independent lineages and 
the biologically exceptional, rapid regeneration 
ability of cervid antlers, we performed large- 
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scale comparative transcriptomics and functional 
experiments in an accompanying paper (87). 
Substantial similarities in the transcriptome profiling 
of different headgear types (87) are consistent with 
asingle origin of headgear in the pecoran ancestor. 
This ancestral headgear subsequently diversified 
into horns, antlers, ossicones, and pronghorns in 
the different pecoran families and was lost inde- 
pendently in the lineages of Moschidae and 
Hydropotinae perhaps because of the convergent 
pseudogenization of the horn-development RXFP2 
gene (87). These complex patterns, along with the 
lack of a well-resolved phylogeny, have confounded 
a synthesis of headgear evolution. 

We further examined the 201 highly expressed 
genes in the headgear of both bovids and cervids 
(87) and identified 36 of these genes harboring 
pecoran-specific CNEs (table $36). Nine genes 
(ALX1, VCAN, COLIA1, SATB2, RUNX2, POSTN, 
SP7, TNC, and COL4A2) were classified in Gene 
Ontology (GO) categories related to bone devel- 
opment. RUNX2 is a key transcriptional factor 
in the regulation of bone development (88) and 
has four pecoran-specific CNEs in its intronic 
regions (fig. S43A and table S36), potentially 
causing high RUNX2 expression in the headgear 
sprouts. SP7 encodes an important regulatory 
factor of the biomineralization and formation 
of bones (89) and has specific mutations in the 
pecoran 3’ untranslated region, one of which was 
located in the binding region of the microRNA 
bta-mir-145 (fig. S43B). These genes were possi- 
bly rewired for the formation of bony headgears 
of ruminants, laying out a hypothesis that can be 
tested experimentally in the future. 

In addition, we explored the genetic basis of 
the keratinous sheath found convergently in 
Bovidae and Antilocapridae headgear (Fig. 5B). 
Transcriptomic data from horn sprouts of sheep 
and goat (87), both of which belong to Bovidae, 
identified seven highly expressed keratin genes: 
KRTI1, KRT2, KRT3, KRT5, KRT10, KRT14, and 
KRTS84 (fig. S44.A). Except for KRTI0 and KRTI/4, 
the above keratin genes encode Type II a-keratin 
proteins, suggesting an essential role for Type II 
o-keratin genes in the formation of the kerat- 
inous sheath of bovids. We further examined 
convergent amino acid substitutions between the 
pronghorn (Antilocapra americana) and bovids. 
Using the 12 mammalian species and other 
ruminant species as outgroups, we identified 
106 proteins (table S37) that contain at least two 
convergent amino acid changes in these two 
ruminant families (23). Among these proteins, a 
horn-specific Type II o-keratin protein, KRT82, 
contained two convergently changed amino acid 
sites, Ala®’Thr and Ser*™ Ala, of which the Ala®’Thr 
is located in the keratin head domain (Fig. 5B and 
fig. S44B). This suggests that Type II a-keratin 
may be important in forming the keratinous 
sheath that evolved convergently in Bovidae and 
Antilocapridae. 


Evolution of body size in ruminants 


Ruminants, especially bovids, encompass a wide 
body size range that spans four orders of magni- 
tude, from as little as 2 kg to as high as 1200 kg 


(Fig. 5C) (10, 11). We retrieved 642 genes related 
to the development of body size and estimated 
the ratios between the nonsynonymous substi- 
tution rate (dN) and synonymous substitution 
rate (dS) of these genes on branches that exhibit 
substantially increased body sizes and on branches 
that exhibit decreased body sizes (fig. S45). Com- 
pared with the background, these genes showed 
significantly higher dN//dS ratios in branches with 
large-sized and small-sized species (Student’s 
t test, P < 0.01) but similar ratios in branches 
with medium-sized species (Fig. 5D). We ob- 
served six genes (CXCL13, RNF115, NPNT, KL, 
SLC9A3R1, and MSTN) that had significantly 
elevated dN/dS ratios along with the increasing 
of body size (Student’s t test, P < 0.01) (table S38), 
and SLC9A3R1 and MSTN have dN/dS values > 1, 
an indication of positive selection. SLC9A3RI1 
affects osteogenesis by mineralizing osteoblasts, 
and disrupting this gene resulted in reduced body 
weights in mice (90). MSTN is an important gene 
in the regulation of muscle cell growth and dif- 
ferentiation (97), and mutations in MSTN affect 
the muscle mass of goat (92), sheep (93), and 
cattle (94, 95) as well as other mammals (96, 97). 
These results indicate that SLCQA3RI and MSTN 
might be targets of natural selection favoring 
increased body sizes (fig. S45) by regulating the 
development of bone and muscle. In reduced 
body size branches, five genes (SBDS, BMP3, 
LRRN3, NFATC3, and SMARCALI) had signifi- 
cantly elevated dN/dS ratios (table S38), and the 
dN/dS ratio of SBDS was larger than 1. SBDS is 
an important gene in cell proliferation and is the 
causal gene of Shwachman-Diamond syndrome 
in humans, which is characterized by skeletal 
abnormalities and short stature (98). BMP3 is a 
well-known gene involved in the regulation of 
osteogenesis in mammals (99). The genes men- 
tioned above may therefore explain the body size 
variation in ruminants and may be relevant to 
livestock breeding application. 

As the tallest terrestrial animal, giraffes have 
a distinct stature and body morphology, which 
likely are adaptions to their savanna habitat (100). 
Among the 366 genes related to bone develop- 
ment in the KEGG annotation, 115 genes had 
giraffe-specific mutations (table S39), including 
genes in the transforming growth factor-B (TGF- B), 
Hedgehog, Notch, Wnt, and FGF signaling path- 
ways. Eleven genes with more than four non- 
synonymous mutations may be related to the 
extreme elongation of body structures in giraffe 
because they are part of bone development path- 
ways: TGF-B (CHRD and LTBPI), Wnt (APC, APC2, 
and CREBBP), Notch (NOTCH1, NOTCH3, and 
NOTCH4), Hedgehog (GL/2 and GLIJ3), and FGF 
(FGFRLI) (Fig. 5C). Three such genes (FGFRLI, 
NOTCH4, and CREBBP) had been identified in 
a previous comparative study by using a low- 
coverage genome assembly of giraffe (Giraffa 
camelopardalis) (table S39) (100). 


Adaptations to cursorial locomotion 


Although early ruminants were probably forest- 
dwelling (101), many lineages have adapted to 
open habitats by adopting more cursorial body 
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plans designed for rapid and/or sustained move- 
ment in an open habitat (102). These include 
adaptations in limb morphology (103) and possi- 
bly also physiology. The pronghorn (Antilocapra 
americana) and members of Antilopinae such 
as the springbok (Antidorcas marsupialis) are 
among the fastest-running animals on Earth 
(104). Both lineages exhibit strong selection in 
their mitochondrial electron transport chains, 
although the targeted genes of selection differ 


Newly evolved genes 
Expanded gene family 
Contracted gene family 
Rapidly evolving genes 
Positively selected genes 


between these species. In Antilopinae, two genes, 
SUOX (sulfite oxidase) and NLN (neurolysin), 
were under positive selection (x? test, P < 0.05) 
(table S24). In the pronghorn, two mitochon- 
drial electron transport chain enzyme encoding 
genes [COXSA (cytochrome c oxidase subunit 
5A) and PPOX (protoporphyrinogen oxidase)] 
were subjected to positive selection (y” test, 
P value < 0.05) (table $24). In addition, four 
proteins in the mitochondrial electron transport 


chain—NDUFAIO (reduced nicotinamide ade- 
nine dinucleotide dehydrogenase 1 a subcomplex 
subunit 10), SDHB (succinate dehydrogenase 
iron-sulfur subunit), UQCRC2 (cytochrome b-cl 
complex subunit 2), and ATP5B (adenosine 5’- 
triphosphate synthase subunit 8), functioning in 
oxidative phosphorylation—exhibited convergent 
amino acid changes (Fig. 5E and tables $40 and 
$41). Furthermore, all species of the bovid tribe 
Alcelaphini, which are adapted to open and seasonal 
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Fig. 4. Newly evolved genes, expanded and contracted gene families, 
REGs, and PSGs in ruminants. (A) Positively selected genes (PSGs), 
rapidly evolving genes (REGs), expanded genes, and newly evolved genes 
are shown along the phylogenetic tree. PSGs and REGs are identified 
with PAML. Newly evolved genes are identified based on the synteny 
relationships, with goat used as a reference. Expanded gene families are 
identified for the pecoran families. Tragulidae are excluded because of the 
relatively low quality of the assembled tragulid genome. (B) Diagram of 
the process of leukocytes crossing blood vessels, highlighting PSGs (orange) 
and REGs (blue). The orange rectangles indicate PSGs, and the blue 
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family in (A), and we chose the best assembled species genome sequences 
to draw the region. The slashes on each line indicate the ends of scaffolds 
in the assembled genomes. 
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grasslands and have cursorial locomotion, shared 
three specific mutations (Tyr"Phe, Gin*““Arg, and 
Glu®’Ala) in the ACE (angiotensin I converting 
enzyme) gene (fig. S46) and one specific muta- 
tion (Glu®’Asp) in the EPO (erythropoietin) gene 


(fig. S47), both of which are important for endurance 
(105, 106). These observations imply that shared 
or distinct molecular pathways might have 
played roles in convergent phenotypic evolu- 
tion during the radiation of ruminants. 


Evolution of dentition 

Ruminants have specialized dentition patterns, 
lacking upper incisors and having a high prev- 
alence of high-crowned or hypsodont teeth—a 
likely adaptation to abrasive diets such as grass or 
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Fig. 5. Genomic features related to ruminant characteristics. (A) Pairwise 
Spearman correlations of gene expressions indicated that the omasum, 
rumen, and reticulum evolved as an extension of the esophagus, whereas 
the abomasum evolved as an extension of the duodenum. Although it is 
anatomically closer to the reticulum, the omasum has a more similar gene 
expression atlas with that of the rumen than the reticulum, which mirrors the 
similar functions between them. (B) Diagram of the convergent feature of 
keratinous sheath in the Bovidae and Antilocapridae. The two red amino acids 
indicate convergent mutations of the KRT82 between Antilocapridae and 
Bovidae, and the red dots indicate the included species of Antilocapridae and 
Bovidae. (C) Body size contrast among ruminants and 11 genes related with 
bone development that have at least four specific mutations in giraffe and 
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are involved in TGF-B, Hedgehog, Notch, Wnt, and FGF pathways. (D) A total 
of 642 genes in GO related to the development of body size are retrieved, 
and we calculated the dN/dS ratios on the branches, leading to large, 
medium, and small body sizes, under the two-ratio branch model (model 2) 
of PAML. Background dN/dS ratios are calculated under one-ratio branch 
model (model 1) of PAML. The distribution densities of the dN/dS values are 
shown. A box plot of dN/dS values in different body size categories is 
shown. The mean dN/dS values at the branches of large body size and 
small body size are significantly larger than background. ***P < 0.01. 

(E) Antilocapridae and Antilopinae species are among the most mobile 
land mammals, and both of them have specific mutations in several genes 
of the mitochondrial electron transport chain. 
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feeding under abrasive conditions, as encountered 
in dry habitats (107). The distinct dentition is 
believed to have contributed to the evolutionary 
success of the ruminants by expanding their food 
sources (108). We observed that 11 genes related 
to dentition had RSCNEs in the 10-kbp down- 
stream or upstream or in introns (table $42). 
Among these genes, ENAM functions in the min- 
eralization and structural organization of enamel 
(109) and is possibly associated with enamel thick- 
ness in humans (770). Furthermore, ENAM is an 
important factor in the adaptation of dentition 
to specific diets (711). We observed seven RSCNEs 
in this gene region (fig. S48A). In addition to 
these RSCNEs, ENAM had a two-amino acid 
insertion and several mutations that were spe- 
cific to ruminants (fig. S48, B and C). A genome- 
wide association analysis using the hypsodonty 
index of ruminants [data from (172)] identified 
one SNP significantly associated (Fisher’s exact 
test, P < 0.01) with the hypsodonty index located 
in the binding site of the EBF/ transcription fac- 
tor in the intron of FGF/4 (fig. S49), an important 
factor controlling tooth development (173). These 
genes provide candidates for future experimental 
studies on mammal dentition. 


Conclusion 


A well-resolved phylogeny of Ruminantia with 
full genome data provides an opportunity to 
understand the evolutionary processes and 
genetic basis of the distinct structure and evo- 
lutionary patterns of ruminants. Our compre- 
hensive evolutionary and comparative analyses 
have revealed numerous genetic variations cor- 
related with specific traits in ruminants. This 
study provides valuable genomic resources as 
well as insights into not only the evolution and 
diversification of ruminants but also our under- 
standing of mammalian biology. 


Materials and methods 


We sequenced and assembled 44 ruminant 
genomes using Illumina reads, with the black 
muntjac (Muntiacus crinifrons) further se- 
quenced with PacBio SMART long reads. The 
assemblies were performed with SOAPdenovo 
v1.05 (1/4), Platanus v1.2.4 (115), and Supernova 
assembler (176), and some contigs were further 
scaffolded with SSPACE v3.0 (117) and “cross_ 
genome” commands in the Phusion2 package 
(118). Repeat elements were annotated by com- 
bining results of Tandem Repeat Finder v4.07b 
(119), RepeatMasker v4.0.5 (120), RepeatModeller 
v1.0.4 (120), and LTR_FINDER V1.0.6 (127). Genes 
were annotated with homologous TBLASTN (122) 
protein searches of human (Ensembl 87 release), 
cattle (Ensembl 87 release), and sheep (Ensembl 
87 release), combining de novo prediction of 
SNAP (123), GENSCAN v1.0 (124), glimmerHMM 
v3.0.3 (125), and AUGUSTUS v2.5.5 (126). Whole- 
genome alignments were constructed with LAST 
v885 (127) and MULTIZ v11.2 (128), using goat as 
the reference. The maximum likelihood phylo- 
genic trees were constructed with RAXML v8.2.9 
(129) and ExaML v3.0.17 (130) by using a gene- 
ral time-reversible model (“GTR+GAMMA’”). 
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ASTRAL III v5.1.1 (36) and MP-EST v2.0 (38) were 
used to perform coalescent species tree estima- 
tions. DiscoVista v1.0 (137) was used to quantify 
and visualize gene tree discordance for alterna- 
tive topologies. Gene flows among ruminant fami- 
lies were performed with ABBA-BABA test (39), 
Dror, (40), admixturegraph v1.0.2 (41), PhyloNet 
v3.6.9 (42), and PhyloNetworks v0.9.0 (43). Time 
calibration was conducted with r8s v1.70 (132), 
BEAST v1.8.4 (133), MCMCTREE in PAML v4.8 
(134), and Multidivtime (735). Demographic his- 
tory reconstruction was carried out by means 
of PSMC analysis (56). PhyloFit v1.4 (736) and 
phastCons v1.4 (137) were used to infer conserved 
nonexonic elements in ruminants. PSGs and 
REGs were identified by use of branch and branch- 
site models in PAML v4.8 (134). The gene family 
expansion or contraction analysis was performed 
by using CAFE v4.7 (138). In-house scripts and 
pipelines are deposited in Zenodo (https://zenodo. 
org/record/2549147). Detailed methods and mate- 
rials are described in the supplementary mate- 
rials (23). 
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Genetic basis of ruminant headgear 
and rapid antler regeneration 
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Guojie Zhang, Yu Jiang}, Wen Wang{, Qiang Qiut 


INTRODUCTION: All pecoran families, except 
the Moschidae, have cranial appendages or 
headgear, a unique structure among mam- 
mals that has a different morphology in each 
family (ossicones in giraffids, pronghorns in 
pronghorn, antlers in cervids, and horns in 
bovids). Moreover, the deer antler is the only 
completely regenerable organ found in mam- 
mals, thus providing a unique model for regen- 
erative biology. Antlers also have extremely 
rapid growth rates (~1.7 cm/day in red deer), 
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with rates of cell proliferation that surpass 
even cancerous tissue growth. Cervids also 
have low cancer rates. The relation between a 
tight regulation of antler growth and inhibi- 
tion of oncogenesis in deer may provide in- 
sights for cancer prevention and therapy in 
humans and other organisms. 


RATIONALE: We obtained 221 transcriptomes 


from bovids and cervids and sequenced three 
genomes representing the two pecoran lin- 


Antler 


Neural crest cellular origin of ruminant headgear and the tight control of rapid antler 
regeneration and low cancer risk in cervids. (Left) Phylogenomic relationships of the 

six ruminant families. Anatomic features of family-specific headgear are depicted, showing that 
headgear of ruminants share tissue and cellular origins. (Upper right) The gene expression 
profile of antler correlates more strongly with osteocarcinoma than with normal bone tissue. 
(Lower right) The balance between rapid antler regeneration, which depends on genes in 

the oncogenic pathway, and reduced cancer risk, which may involve adaptive evolution of 


tumor suppressor genes. 
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eages that convergently lack headgear. Com- 
paring the data with a large set of ruminant 
genomes, including nine cervids, we detect 
genetic changes (lineage-specific positively 
selected genes and conserved elements) in 
pecorans with headgear (PWH), particularly 
cervids. Using the observed genetic changes 
and gene expression in headgear, we explore 
the genomic basis of ruminant headgear ori- 
gin and antler regeneration. 


RESULTS: We find that highly or specifically 
expressed genes in horns and antlers are most 

frequently coexpressed in 
bone, skin, nerve tissues, 
Read the full article and testis. Many genes 
at http://dx.doi. under positive selection in 
org/10.1126/ PWH (e.g., OLIGI, OTOP3), 
science.aav6335 PWH-specific genes asso- 
ciated with highly con- 
served elements (e.g., HOXD gene cluster, 
SNAI2, TWISTI, SOX), and genes highly or 
specifically expressed in headgear (RXFP2, 
SOX10, NGFR) are involved in neural functions. 
In addition, RXFP2, which is specifically ex- 
pressed in headgear and testis, was conver- 
gently pseudogenized in the headgearless 
lineages of Moschidae and Hydropotinae. The 
expression profile of antler is more correlated 
with osteocarcinoma than with normal bone 
tissue expression profiles. A number of proto- 
oncogenes (FOS, REL, FAM83A) and tumor 
suppression genes have been positively selected 
in cervids, especially several cofactor genes 
(PML, NMT2, and CD2AP) and regulator genes 
(ELOVL6, SI00A8, ISGI5, CNOT3, and CCDC69) 
of the p53 tumor suppressor, suggesting that 
these adaptive changes may enhance cancer 
resistance in deer. 


CONCLUSION: Together, the phylogeny, gene 
expression profiles, and convergent headgear 
losses support a single evolutionary origin of the 
ruminant headgear. Pecoran headgear likely 
share a common cellular origin from neural 
crest stem cells, and the determination of the 
chondrogenic and neural lineages is important 
for headgear development. In addition, cervid- 
specific genetic changes in tumor suppressor 
and proto-oncogenes imply that the regen- 
erative properties of antler tissue exploit onco- 
genesis pathways. Our study reveals genetic 
mechanisms underlying the evolutionary, de- 
velopmental, and histological origin of rumi- 
nant headgear, as well as antler regeneration. 
The identified genes and their unique muta- 
tions provide guidelines for future functional 
studies of headgear development, regeneration 
of mammalian organs, and oncogenesis. 
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Genetic basis of ruminant headgear 
and rapid antler regeneration 


Yu Wang", Chenzhou Zhang”*, Nini Wang", Zhipeng Li®*, Rasmus Heller**, 

Rong Liu®**, Yue Zhao, Jiangang Han™, Xiangyu Pan’, Zhuqing Zheng", 

Xueqin Dai*®, Ceshi Chen*’®, Mingle Dou’, Shujun Peng’, Xianqing Chen”, Jing Liu’, 
Ming Li’, Kun Wang”, Chang Liu”, Zeshan Lin”, Lei Chen”, Fei Hao®, Wenbo Zhu’, 
Chengchuang Song’, Chen Zhao’, Chengli Zheng’, Jianming Wang’, Shengwei Hu’®, 
Cunyuan Li’®, Hui Yang®, Lin Jiang’, Guangyu Li®, Mingjun Liu”, Tad S. Sonstegard’”, 
Guojie Zhang®"*"**, Yu Jiang’+, Wen Wang”®"*+, Qiang Qiu’+ 


Ruminants are the only extant mammalian group possessing bony (osseous) headgear. 
We obtained 221 transcriptomes from bovids and cervids and sequenced three genomes 
representing the only two pecoran lineages that convergently lack headgear. Comparative 
analyses reveal that bovid horns and cervid antlers share similar gene expression 

profiles and a common cellular basis developed from neural crest stem cells. The rapid 
regenerative properties of antler tissue involve exploitation of oncogenetic pathways, and 
at the same time some tumor suppressor genes are under strong selection in deer. These 
results provide insights into the evolutionary origin of ruminant headgear as well as 
mammalian organ regeneration and oncogenesis. 


uminants are the only group of extant 
mammals with osseous cranial append- 
ages, which are collectively termed head- 
gear (1). Osseous headgear are exclusively 
found in the pecorans (all ruminants, 
excluding Tragulidae), a group that radiated 
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~23.3 million to 20.8 million years (Ma) ago (2) 
(Fig. 1). Each family in the pecoran group ex- 
hibits a distinct headgear morphology (3). The 
ossicones of Giraffidae consist of bony protu- 
berances covered only by skin and hair. The 
pronghorns of Antilocapridae are composed of 
bone covered by skin, hair, and an annually de- 
ciduous forked keratinous sheath. The horns of 
Bovidae also have a bony core but are covered by 
a nondeciduous, nonforked keratinous sheath. 
The antlers of Cervidae are wholly deciduous, 
regenerating annually as an outgrowth of bone 
from the frontal skull. Despite this variation, all 
these types of pecoran headgear (including those 
of extinct species) share characteristic features, 
such as their frontal cranial position and a bony 
core covered by integument (fig. S1). The evo- 
lutionary origin of headgear, specifically whether 
headgear evolved only once or multiple times, 
has been a matter of considerable scientific dis- 
cussion (1). Resolving this has proved challeng- 
ing because of the lack of consensus regarding 
the family-level phylogeny of the ruminants. 


Results 


A comprehensive phylogenetic analysis of the 
ruminants (2) resolved the family-level topology 
and shows that the most phylogenetically par- 
simonious hypothesis is a single origin of head- 
gear in pecorans followed by two independent 
losses (Fig. 1). Multiple independent origins 
would be at odds with the rapid radiation of the 
five Pecora families, which took place during an 
~2.5-million-year interval (23.3 to 20.8 Ma ago), 
and furthermore it is difficult to explain why 
headgear would have evolved multiple times in 
pecorans yet be absent in all other mammalian 
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taxa. The Ruminant Genome Project (2) provides 
an opportunity to investigate the genomic back- 
ground of ruminant headgear evolution and ad- 
dress its implications in organ regeneration. 


Shared gene recruitment in horns 
and antlers 


The evolutionary origin of any new organ typi- 
cally depends on the recruitment of genes that 
were originally expressed in other tissues (4). 
To identify genes recruited in bovid horns, we 
compared 181 transcriptomes obtained in this 
study—representing 16 tissues and including 
7 transcriptomes of goat horn sprouts, 61 of other 
goat tissues, 3 of sheep horn sprouts, and 110 of 
other sheep tissues—and added 49 published 
sheep transcriptomes (5) (table S1). In addition, 
transcriptomes from two fetal sheep horn buds 
and adjacent frontal skin tissues were sequenced 
to identify differentially expressed genes (DEGs) 
between these two tissue types at this important 
developmental stage. For cervid antlers, we se- 
quenced 20 roe deer (Capreolus capreolus) and 
20 sika deer (Cervus nippon) samples represent- 
ing 16 tissues, including neonatal antlers (table 
S1). Genes specifically expressed in headgear 
tissues (hereafter headgear-specific genes) were 
defined as those that have a t index exceeding 
0.8 and are expressed most strongly or second 
most strongly in headgear tissues (5). We identi- 
fied 624 horn-specific genes (table S2), and these 
were most highly coexpressed in bone, skin, 
testis, and brain tissues (Fig. 2A and fig. S2A). 
We also identified 761 antler-specific genes that 
were most highly coexpressed in the same four 
tissues (Fig. 2A, fig. S2B, and table S3). In addi- 
tion, 201 headgear-specific expression genes were 
shared by both antler and horn tissues (fig. S3A), 
and these genes were enriched in bone develop- 
ment, skin development, and neurogenesis path- 
ways (Fisher's exact test, adjusted P value < 
1x 10) (fig. S3B and table $4). The DEGs (table $5) 
between fetal sheep horn bud and adjacent frontal 
skin tissues were enriched in nerve development 
pathways (fig. S4 and table S6). Histological anal- 
ysis of cattle fetal horn buds suggested that, in 
contrast to the frontal skin of polled fetuses, neu- 
ral tissue exists only in the horn buds (6). Overall, 
the gene expression results suggest that the de- 
velopment of both kinds of headgear considered 
here (horns and antlers) depends on similar gene 
expression profiles, largely recruited from nerve, 
bone, and skin tissues. 


A common genetic and cellular basis 
of headgear 


To identify the genetic basis of headgear evolu- 
tion, we used the branch-site likelihood ratio 
test (5) to detect positively selected genes shared 
only by pecorans with headgear (PWH). A total of 
240 genes were identified as positively selected 
in PWH (table S7). Enriched functional Gene 
Ontology (GO) categories of these genes included 
biomineral tissue development (Fisher's exact 
test, adjusted P value = 4.9 x 10°”) (table S8), 
providing further evidence that the evolution of 
osseous headgear depends on bone development. 
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Fig. 1. Phylogenomic placement of species without headgear in the Ruminantia and Cervidae. (Left) The phylogenomic relationships of the 

six ruminant families from (2). (Right) Maximum-likelihood tree for the nine studied cervid species obtained using 3,316,385 four-fold degenerate sites. 
The anatomic structure of headgear in each family is depicted, including the keratin sheath of Antilocapridae and Bovidae and the regenerable antler 
of Cervidae. The red bar indicates secondary headgear loss and the red text highlights the de novo assembled species in this study. The number of 
species in each family used in this study is indicated in parentheses. Sources and credits for species photos are listed in table S26. 


Among these genes, we found that OTOP3 was 
under positive selection in PWH along with a 
headgear-specific gene (fig. S5 and tables S2, S3, 
and S7). OTOP3 is a member of the otopetrin 
gene family, which regulates biomineralization 
processes (7). Of the eight Pecora-specific amino 
acid mutations in the OTOP3 protein, four are 
located in the otopetrin functional domain 
(PF03189) (fig. S5). OTOP3 is expressed in the 
neural crest of Xenopus embryos, suggesting a 
role in neural crest function (8). OLIGI (fig. S6), 
another positively selected gene in PWH, is also 
related to neural crest differentiation pathways 
(9). A previous study showed that a 212-base pair 
(bp) duplication (~65 kbp) flanking the OLIG1 
gene region is the causal mutation of polled 
cattle, i.e., cattle that completely lack horns (0). 
We detected nine distinct amino acid changes 
in OLIGI in the inferred common ancestor of 
Pecora, resulting in domain structure changes as 
revealed by protein structure homology model- 
ing (fig. S6). Given that the frontal cranial bones 
are derived from cells of the cranial neural crest 
(11), changes in the OTOP3 and OLIGI genes 
likely played crucial roles in the evolution and 
development of pecoran headgear (Fig. 2B). 
We also identified 8732 lineage-specific highly 
conserved elements (HCEs) (=20 bp) (table S9) 
in PWH using the phylogenetic hidden Markov 
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model approach (5). The PWH-specific HCE- 
associated genes are enriched in the signaling 
pathway regulating the pluripotency of stem cells 
(Fisher's exact test, adjusted P value = 2.5 x 10°°) 
and transforming growth factor-B (TGF-f) (Fish- 
er’s exact test, adjusted P value = 3.5 x 10° 
(table S10). The TGF- signaling pathway plays 
an important role in bone formation and the 
regulation of cranial neural crest cell prolifera- 
tion during frontal bone development (12). The 
PWH-specific HCE-associated genes SNAI2, 
TWISTI1, SOX9, and the HOXD gene cluster are 
involved in neural crest cell migration (9) (Fig. 2B 
and table S9). Specifically, we identified a PWH- 
specific 25-bp HCE 15 kbp downstream from the 
HOXD gene cluster (fig. S7A) that serves as a 
master regulator in neural crest patterning, par- 
ticularly in the cranial region (13). Further anal- 
ysis indicates that this element resulted from a 
3.6-kbp Pecora-specific transposable element in- 
sertion (fig. S7, B and C) located in the candidate 
region causing the four-horned phenotype in 
sheep (/4). These results suggest that the evolu- 
tion of PWH-specific regulatory elements may 
also play a role in reprogramming neural crest 
cells to develop into headgear. 

In addition, we found that six neural crest cell 
migration-related genes (SOX10, SNAII, SNAI2, 
TFAP2A, NGFR, and COLIIA2) are specifically 


expressed in headgear (tables S2 and S3). SOX10, 
SNAI/, and TFAP2A are highly expressed in fetal 
sheep horn buds but not in adjacent skin tissue 
(fig. S4 and table S5). Notably, SOX70 and NGFR 
are used as marker genes of neural crest cells 
(15), and our immunohistochemical analysis 
confirmed that SOX70- and NGFR-positive cells 
are present in the embryonic horn bud of sheep 
(fig. S8). In addition, two headgear-specific genes, 
FOXL2 and TWIST], are related to horn abnormal- 
ities (16, 17) (tables S2 and S3) and both interact 
with SOX9, a marker gene for the determination 
of the chondrogenic lineage in the cranial neural 
crest (18). From the results of our comparative 
genomic analysis and previous findings regard- 
ing horn-related genes (10, 14, 16, 17), we conclude 
that pecoran headgear likely share a common 
cellular origin in neural crest stem cells (Fig. 2B). 
This supports a single evolutionary origin for 
pecoran headgear despite their morphologi- 
cal diversity. 


Convergent pseudogenization led to 
secondary loss of headgear 


We supplemented the 51 ruminant genomes in the 
Ruminant Genome Project (2) with a high-quality 
reference genome from a secondarily antlerless 
species, the Chinese water deer (Hydropotes 
inermis) of the cervid subfamily Hydropotinae 
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Fig. 2. Gene recruitment and cellular origin of ruminant headgear. 

(A) Genes recruited to headgear from different organs. Bone, skin, testis, 
brain, and others are marked as orange, purple, yellow, green, and various 
shades of gray, respectively. (B) Diagram of neural crest cell genes 
involved in headgear development. The HOXD gene cluster is depicted as 
the hypothesized master regulator of headgear. Genes annotated in the 
neural crest cell migration and differentiation pathway are labeled with 


(fig. S9 and tables S11 to S13), and two contig- 
level genomes of Moschidae species (Moschus 
chrysogaster and M. leucogaster) (table S14). 
From the high-confidence phylogenetic tree ob- 
tained with whole-genome data (Fig. 1), it is clear 
that Moschidae and Hydropotinae have inde- 
pendently lost their headgear. To explain this 
convergent feature and learn more about genes 
controlling headgear, we investigated pseudo- 
genization in the shape of premature stop codons 
and frameshifts in both of these lineages (Fig. 1 
and tables S15 and S16). Of 289 pseudogenes 
identified in these two distant lineages, RXFP2 
was the only headgear-specific gene that was 
convergently pseudogenized, although the muta- 
tions occur at different sites of the RXFP2 gene 
in these lineages (fig. S10). RXFP2, which our 
transcriptomic data indicate was recruited from 
testis tissue into horn and antler, is highly ex- 
pressed in the fetal sheep horn bud (fig. S10) and 
has two PWH-specific HCEs in its intron region, 
one of them with a binding motif for the horn- 
specific gene ARNT (fig. S11). Previous studies 
identified RXFP2 as a sexually selected gene as- 
sociated with horn morphology in bighorn sheep 
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(Ovis canadensis) (19, 20). Additionally, a 1.8-kbp 
insertion in the 3’ untranslated region (3’UTR) of 
RXFP2 is associated with lack of horns in sheep 
(21). Collectively, these results indicate that 
pseudogenization of RXFP2 is the most likely 
functional mechanism behind the convergent 
secondary loss of headgear in the Moschidae and 
Hydropotinae lineages. 


Neural processes involved in 
antler regeneration 


The deer antler is the only completely regenerable 
organ found in mammals and thus provides a 
unique model for regenerative biology. Antler 
regeneration is a stem cell-based process (22), 
and we demonstrated that antler stem cells may 
originate from cranial neural crest cells, which 
have the potential to rapidly proliferate and dif- 
ferentiate into cartilage and neural cells (Fig. 2B). 
Notably, growing antlers are richly innervated 
by sensory fibers, and resection of the antler 
pedicle sensory nerve markedly reduces antler 
regeneration and stunts antler size (23). Both 
antler-specific expression genes (table $3) and 
cervid-specific HCE-associated genes (table S17) 
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are enriched in annotation terms associated 
with the axon guidance pathway, particularly 
the genes coding for key guidance molecules 
for axon growth: slits, ephrins, and semaphorins 
(Fig. 3A and tables S18 and S19). In addition, the 
top eight rapidly evolving genes in cervids with 
adjacent HCEs are all related to neural functions 
(fig. S12 and table S20). We also found that the 
nerve growth factor receptor (NGFR) gene is 
strongly and specifically expressed in antlers 
(table S3). This is consistent with findings that 
neural growth factors promote the growth of 
antler nerves (24), and these lines of evidence 
corroborate the involvement of neural processes 
in annual antler regeneration. 


Similarities between antler growth and 
cancer cell growth programs 


Antlers grow extremely quickly, as exemplified 
by red deer antlers, which have average growth 
rates of 1.7 cm/day and can reach a weight of up 
to 30 kg (25). These antlers regrow annually 
(from spring to summer) owing to very fast cell 
proliferation (Fig. 3B) that surpasses even can- 
cerous tissue growth (25). Antler growth mainly 
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Fig. 3. Oncogenesis and 
neurogenesis pathways 
involved in rapid antler 
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proceeds by chondrocyte proliferation and ossi- 
fication (Fig. 3C) and thus provides a research 
model for osteosarcoma. We found a higher cor- 
relation between the gene expression profiles 
of antler and osteosarcoma (r = 0.67 to 0.78) 
than between those of antler and normal bone 
tissues (r = 0.33 to 0.47) (fig. S13), showing sim- 
ilar patterns of developmental programs in antler 
growth and oncogenesis. 

We found evidence that three proto-oncogenes 
(FOS, FAM83A, and REL) were under positive 
selection in the cervid ancestor (fig. S14 and table 
$21). Of these, FOS acts as a downstream growth 
factor signaling pathway that regulates cell pro- 
liferation and differentiation. Overexpression of 
FOS induces osteosarcoma formation in mice via 
the transformation of chondroblasts and osteo- 
blasts (26). Additionally, FAM83A has been iden- 
tified as an oncogene involved in the epidermal 
growth factor receptor (EGFR) signaling path- 
way (27). We also observed antler-specific expres- 
sion of five growth factor and receptor genes 
(FGF19, FGF21, FGFBP3, PDGFD, and PDGFRL) 
that play important roles in driving cancer cell 
proliferation and survival (28) (table S3). In addi- 
tion, a cervid-specific HCE is located in the 3'UTR 
of NOVAI, which is believed to activate telomerase 
and promote tumor growth in vivo (29) (figs. S15 
and S16 and table S17). Taken together, these cell 
growth-associated cervid-specific changes and 
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the expression of tumor promoters in antlers 
(Fig. 3D) indicate that the rapid proliferation of 
cells required for rapid antler growth has simi- 
larities with cancer cell growth programs. 


Regulation of antler growth may confer 
cancer resistance 


Cancer frequency records from both the Philadelphia 
and San Diego zoos indicate that cancer incidence 
rates are ~5 times lower in cervids than in other 
mammals (0.4 to 0.8% and 2.1 to 4.6%, respec- 
tively) (30, 31). This tentatively suggests that the 
precisely regulated cell growth regulators required 
for controlled rapid antler regeneration may con- 
fer protection against the development of cancers 
in cervids because of specific genetic changes 
relevant to cancer avoidance. Accordingly, Kyoto 
Encyclopedia of Genes and Genomes (KEGG) anal- 
ysis of DEGs between antler and osteosarcoma 
shows enrichment for cancer- and metabolism- 
related pathways (fig. S13B and table S22). 

We also observed that many tumor suppres- 
sor genes are under positive selection (table S21) 
and strongly expressed in antlers. Among these, 
the tumor suppressor gene PML has 11 cervid- 
specific nonsynonymous changes and carries the 
strongest signal of positive selection detected in 
cervids (Fig. 4A and table S21). PML is a transcrip- 
tional coactivator of p53, and its overexpression 
enhances p53 transcriptional activation and leads 


to cell growth arrest (32). The TP53 (encoding the 
protein p53) signaling pathway plays a central 
role in regulating cell division and preventing 
tumor formation (33). We observed that three 
p53 cofactor genes (PML, NMT2, and CD2AP) 
and five p53 regulator genes (ELOVL6, SI00A8, 
ISG15, CNOT3, and CCDC69) were under posi- 
tive selection in the cervid lineage and expressed 
in antlers (Fig. 4B and table $21). We also noticed 
that TP53 itself was identified as a rapidly evolv- 
ing gene in the cervid lineage from the evolu- 
tionary analysis of 51 ruminant genomes (2). 

In addition to the 7P53 pathway-related genes, 
we also observed several other tumor suppres- 
sor genes that were under selection in the cervid 
lineage and expressed in antlers. One such gene, 
ADAMTSIS8 (Fig. 4A), belongs to the ADAMTS 
family, which encompasses disintegrin and 
metalloproteinase-like proteinases that inhibit 
growth of carcinoma cells by controlling the 
structure and function of the extracellular matrix 
(ECM) and regulating the tumor microenviron- 
ment (34). The components and function of the 
ECM are known to play an important role in can- 
cer resistance in the naked mole-rat (35). In addi- 
tion, the ADAMTS family members ADAMTS2, 
ADAMTS4, ADAMTS12, ADAMTS14, ADAMTSI7, 
and ADAMTSI8 are not only all antler-specific 
genes (fig. S17) but are also more highly expressed 
in antler than in osteosarcoma (fig. S13C). 
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Fig. 4. Examples of positively selected tumor suppressor genes in cervids. (A) Gene models showing cervid-specific mutations of two positively 
selected tumor suppressor genes, PML and ADAMTSI8. PML has the strongest selection signals detected in cervids (likelihood ratio test, P value = 1.63 x 10-©) 
(table S21). (B) Genes positively selected in the p53 signaling pathway of cervids. Selected p53 cofactors are highlighted with yellow hexagons, and selected 


regulators are marked with green ovals. 


Finally, several genes involved in DNA damage 
response pathways showed signatures of cervid- 
specific evolution in our transcriptomic and com- 
parative genomic analyses. 7P73 and TP53113 
suppress tumors through their roles in the p53- 
mediated DNA damage response pathway (36) 
and are specifically expressed in antlers (table S3). 
Moreover, we found that three more DNA dam- 
age response genes (SLFI, RHNOJ, and DDB2) 
were under positive selection in the cervid lin- 
eage (table S21). 

The cervid-specific expression and genetic 
changes in these tumor suppressor and DNA 
repair genes may play important roles in the fine- 
tuned regulation of rapid antler regeneration, 
while at the same time preventing the onset of 
cancers. Further detailed functional studies may 
therefore be of great scientific significance in 
demonstrating the mechanisms underlying rapid 
but controlled cell growth and exploring the po- 
tential of cervids as a cancer model. 


Discussion 


Headgear, or cranial appendages (antlers in cervids, 
horns in bovids, pronghorns in pronghorn, and 
ossicones in giraffids), are conspicuous and diverse 
features of the Pecora lineage within Ruminantia 
and are unique among all mammals. Headgear 
evolution is still debated, partly because of dif- 
ferences in the evolutionary scenarios suggested 
by different phylogenic trees, coupled with the 
notable differences in headgear anatomy and 
development (7). Therefore, it has been proposed 
that pecoran headgear could have multiple inde- 
pendent origins. We show that the most parsi- 
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monious explanation is a single headgear origin 
(2). Our comparative genomic and transcriptomic 
analyses indicate that horns and antlers are very 
similar in their gene expression profiles and that 
pecoran headgear share cellular origins from 
neural crest stem cells. Moreover, the headgear- 
specific gene RXFP2 was convergently pseudo- 
genized in secondarily headgearless lineages, 
corroborating the single evolutionary origin hy- 
pothesis by providing a simple genetic mecha- 
nism for the otherwise puzzling convergent loss 
of headgear. Notably, the extremely rapid growth 
and peculiar regeneration of cervid antlers are 
biological features of interest. For instance, an 
incision in an antler tine results in a slight scar 
that in the following year leads to a small tine, 
whereas injury to the pedicle leads to perma- 
nent inhibition of either pedicle or antler growth 
(37, 38). Furthermore, it has been shown that 
electrical stimulation of antler nerves increases 
antler length and weight (39), suggesting that 
neural tissue plays an important role in antler 
regeneration. This link was corroborated by the 
neural associations of several antler-specific genes 
and cervid-specific HCE-associated genes identi- 
fied in this study. Further integrated functional, 
physiological, and transcriptomic analyses of a 
wider range of samples during various antler 
growth stages are warranted to clarify the roles 
of these antler-specific neural genes in develop- 
ment and their potential utility in regenerative 
medicine and in vitro organ regeneration. 

The antler is an exclusively male trait in cervids 
(except reindeer), which has been strongly se- 
lected by sexual selection, and some species most 
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likely became extinct as a result of exaggerated 
antler size (40). Our data suggest that fast-growing 
cervid antlers have expression profiles that are 
more similar to those of osteosarcoma than to 
those of normal bone tissues (fig. S13). On the 
other hand, cervids have much lower cancer inci- 
dence than other mammals (30, 37). It is conceiv- 
able that natural selection might have selected 
for efficient cancer-defense mechanisms in deer. 
It has previously been shown that elephants have 
a reduced cancer risk because of functional du- 
plicates of the master tumor suppressor 7P53 (4). 
In contrast, cervids have a single copy of TP53, 
but other genes (PML, NMT2, CD2AP, ELOVL6, 
S100A8, ISGI5, CNOT3, and CCDC69) functioning 
in the p53 pathway are under positive selection, 
suggesting that cervids may have evolved an 
enhanced 7P53 signaling pathway to constrain 
tumor growth. Elephants and deer may therefore 
have independently evolved different strategies 
to avoid cancer by targeting the same central 
tumor-controlling p53 regulatory pathway. We 
also found evidence that other tumor suppres- 
sor genes and proto-oncogenes have been under 
strong positive selection in cervids and/or are 
strongly and specifically expressed in the antler 
(e.g., ADAMTS18, FOS, REL, and FAM83A). Our 
study reveals the genetic mechanisms underly- 
ing the evolutionary, developmental, and histo- 
logical origins of pecoran headgear and provides 
insights into the molecular mechanisms of re- 
generation of deer antler and its relevance to 
cancer resistance. The identified genes and 
their specific mutations provide a starting 
point for future functional studies of headgear 
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development, regeneration of mammalian organs, 
and oncogenesis. 


Materials and methods 


We sequenced and collected 221 and 49 tran- 
scriptomes, respectively, including 20 of roe deer, 
20 of sika deer, 68 of goat, and 162 of sheep. RNA- 
sequencing reads were aligned with HISAT2 
v2.0.3 (42), and the gene expression levels were 
quantified with StingTie v1.2.2 (43). Tissue-specific 
expressed genes were calculated with the t index 
method (44). The DEGs in the embryonic horn 
bud were identified with DESeq2 v1.20.0 (45). 
Immunohistochemical analysis was performed 
to detect the expression of neural crest marker 
genes NGFR and SOX10 in the embryonic horn 
bud. The genome of the Chinese water deer 
(H. inermis) was sequenced using 10X genomic 
platform and assembled with Supernova soft- 
ware V1.2.1 (46). Two contig-level assemblies of 
Moschidae species were obtained by Illumina 
HiSeq 2000 sequencing and assembled using 
the SOAPdenovo software v2.04 (47). Pseudogenes 
of the three headgearless genomes were iden- 
tified with GeneWise v4.0 (48). Whole-genome 
alignments of 54 ruminant species were carried 
out with LAST vlast867 (49) and multiz v11.2 (50) 
using goat as the reference genome. Positively 
selected genes were identified by using the 
branch-site model in PAML v4.9e (51). PWH- 
specific HCEs were identified with phastCons 
v1.4 (52). Elements showing accelerated evo- 
lution in Cervidae were detected using phyloP 
v1.4 with the CONACC mode and LRT test 
method (52). 
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INTRODUCTION: Reindeer (Rangier tarandus) 
are naturally distributed across the Arctic and 
subarctic regions. Consequently, these animals 
have evolved to face numerous challenges, in- 
cluding exposure to severe cold, limited food 
availability in winter, and extremely prolonged 
light or dark periods. Unlike all other cervid 
species, both male and female reindeer annually 
grow deciduous antlers. Furthermore, reindeer 
are the only fully domesticated species among 
the Cervidae. However, little is known about the 
underlying genetic causes of these traits. 


RATIONALE: We performed comparative ge- 
nomic analyses between reindeer, other rumi- 
nant species, and a number of mammalian 
outgroups to identify rapidly evolving genes, 
positively selected genes, and reindeer-specific 
mutants. We further resequenced the genomes 
of three domestic reindeer from northern China 


Mutations in vitamin D metabolism 
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and three wild reindeer from Northern Europe 
to validate that the reindeer-specific mutations 
are fixed in the species rather than individual 
polymorphisms. To support our computation- 
ally derived insights, we subsequently conducted 
in vitro functional experiments to investigate 
possible functional consequences of some of the 
reindeer-specific mutated genes. 


RESULTS: We found two genes (CYP27B] and 
POR) involved in the vitamin D metabolism 
pathway to be under positive selection in rein- 
deer. Furthermore, our functional experiments 
validated that the two key enzymes (CYP27B1 
and POR) exhibit much higher catalytic activity 
than that of the orthologs in goats and roe deer. 
We also identified fixed reindeer-specific muta- 
tions in genes that play a role in fat metabolism, 
including APOB and FASN. We showed that a 
mutation upstream of the reindeer CCNDI gene 
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Unique mutations explain the biological adaptations of reindeer. (Left) Two genes (POR and 
CYP27B1) play an important role in vitamin D metabolism in reindeer. (Middle) A newly identified 
binding motif of the androgen receptor (AR) evolved upstream of a key antler CCND1 gene, which may 
result in female antler growth. bp, base pairs. (Right) A key reindeer-specific mutation (P1172T) in 
PER2 results in loss of binding ability with CRY1, which can cause the observed circadian arrhythmicity 
in reindeer. The circadian genes highlighted in red are positively selected in reindeer; the ones shown 
in blue are rapidly evolving genes. PACAP, pituitary adenylate cyclase activating polypeptide; SCN, 
suprachiasmatic nucleus; ER, endoplasmic reticulum; CREB, cAMP response element-binding 
protein; nNOS, neuronal nitric oxide synthase; P, phosphorus; G, G protein. 
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endows an extra functional binding motif to the 
androgen receptor and thus may result in fe- 
male antler growth. In the circadian rhythm 
pathway, we observed that eight genes have 
reindeer-specific mutations and that four genes 


have been rapidly evolv- 

ing. Among them, the 
1172 

Read the full article Pro ; Thr (P1721) muta- 

at http://dx.doi. tion in the reindeer PER2 

org/10.1126/ causes loss of binding abil- 


science.aav6312 ity with CRYI, which can 


cause arrhythmicity. Fi- 
nally, we found reindeer-specific mutations in 
11 genes relating to development, migration, 
and differentiation of neural crest cells, prob- 
ably accounting for the tameness of reindeer. 


CONCLUSION: Our results reveal the genetic 
basis of a broad spectrum of the Arctic deer’s 
traits and provide a basis for understanding 
mammalian adaptive strategies to the Arctic. 
Our comparative genomic studies and func- 
tional assays identify a number of genes that 
exhibit functionality related to circadian ar- 
rhythmicity, vitamin D metabolism, docility, and 
antler growth, as well as genes that are uniquely 
mutated and/or are under positive selection. 
Our results may provide insights relevant to hu- 
man health, including how the genetic response 
of vitamin D in reindeer affects bone and fat 
metabolism and how genes can affect circa- 
dian arrhythmicity. 


The list of author affiliations is available in the full article online. 
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The reindeer is an Arctic species that exhibits distinctive biological characteristics, for 
which the underlying genetic basis remains largely unknown. We compared the genomes of 
reindeer against those of other ruminants and nonruminant mammals to reveal the genetic 
basis of light arrhythmicity, high vitamin D metabolic efficiency, the antler growth trait of 
females, and docility. We validate that two reindeer vitamin D metabolic genes (CYP27B1 and 
POR) show signs of positive selection and exhibit higher catalytic activity than those of other 
ruminants. A mutation upstream of the reindeer CCND1 gene endows an extra functional 
binding motif of the androgen receptor and thereby may result in female antlers. Furthermore, 
a mutation (proline-1172-threonine) in reindeer PER2 results in loss of binding ability with 
CRY1, which may explain circadian arrhythmicity in reindeer. 


eindeer (Rangifer tarandus) are naturally 
distributed across the Arctic and subarctic 
regions. They face challenges such as se- 
vere cold and limited food availability dur- 
ing the winter and prolonged periods of 
both light and darkness during the year (J) and 
have thus had to evolve strategies and features to 
address these environmental obstacles. For ex- 
ample, reindeer do not exhibit 24-hour activity 
rhythms, and evidence suggests weak or even 
absent circadian organization (2-4), indicating 
that their internal biological clock may be adapted 
to the extreme light periodicity of the Arctic. These 
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animals have also developed a particular fat me- 
tabolism process, limit heat loss by peripheral 
vasoconstriction, and maintain a low resting 
metabolic rate (7). A further challenge is that 
despite having to cope with low levels of solar 
energy, reindeer need to meet the calcium de- 
mands of rapid antler growth, therefore imply- 
ing that they must have evolved a particularly 
high capacity for metabolizing vitamin D (5). 
Additionally, whereas archaeological evidence 
indicates that humans have exploited cervid 
species for millennia (6), R. tarandus is the only 
fully domesticated species within the Cervidae 
family (7). In this regard, the emergence of ex- 
tensive reindeer husbandry during the last cen- 
turies was of considerable importance for the 
development of human civilization in the high 
Arctic of Asia and Europe. Reindeer are main- 
tained in large domestic herds (7) that exhibit 
differences in activity patterns, tameness, and 
mitochondrial sequences from those of their 
wild relatives (7-9). However, despite the combi- 
nation of these features, little is currently known 
about their underlying genetic causes, something 
we aimed to address in this study. 


Reindeer genomes exhibit 
vitamin D-specific mutations 


Although the Ruminantia is an important group 
of terrestrial herbivores and its members have 
adapted to a wide range of terrestrial habitats, 
R. tarandus is the only species within the Cervidae 
that is widely distributed across the Arctic and 
subarctic regions. Because both reindeer sexes 
grow large antlers while inhabiting regions that 


undergo extended periods of low or no solar en- 
ergy, it is conceivable that this species requires 
efficient calcium metabolism and reabsorption. 
Levels of active vitamin D [10,25-(OH).Ds] in deer 
blood have been associated with antler growth 
rate (10, 11). We therefore recovered all 28 genes 
from the reindeer reference genome (72) that are 
involved in the vitamin D metabolism pathway and 
compared them to their orthologs in other ruminant 
species from our ruminant genome project (73) as 
well as other mammalian outgroups (/4), including 
the pig (Sus scrofa domesticus) (15), horse (Equus 
caballus) (16), and human (Homo sapiens) (17). 

We found two genes (CYP27BI1 and POR) to 
be under positive selection in reindeer [branch- 
site model in phylogenetic analysis by maximum 
likelihood (PAML), chi-square test, P < 0.05] (Fig. 
1A, fig. S1, and tables S1 and S2) (/4). Further- 
more, six genes (ACAT2, CYP5IAI, EBP, CYP2RI1, 
BK2, and TRPV5) exhibited reindeer-specific mu- 
tations (Fig. 1A, fig. S2, and table $3), and APOB 
was identified as a rapidly evolving gene [branch 
model in PAML, chi-square test, P < 0.05, a 
higher K,/K, ratio (ie., the ratio of the number 
of nonsynonymous substitutions per nonsynon- 
ymous site to the number of synonymous sub- 
stitutions per synonymous site)] (Fig. 1A and 
tables S4 and S5) (14). To explore whether these 
alleles were fixed in reindeer, we resequenced 
three domesticated reindeer from the Greater 
Khingan Mountains of the Inner Mongolia Auton- 
omous Region, China, and three wild reindeer 
from the Snghetta mountain area of Norway (14) 
and found that the mutant alleles were identical 
in all seven individuals (fig. S2). Note that all 
mutations described hereafter in other genes 
are also fixed in all seven reindeer. 

CYP27B1 and POR are enzymes that play key 
roles in triggering the active 10,25-(OH).D; from 
vitamin D (18). Our genomic comparisons of 
reindeer against other ruminants [including the 
closest cervid genome available, which is from the 
Siberian roe deer (Capreolus pygargus), GenBank 
Assembly accession: GCA 000751575.1] and out- 
groups revealed reindeer-specific mutations at 
three positions in both the CYP27Bi] (Fig. 1B and 
table S3) and POR genes (fig. S2 and table S3). 
We also conducted three-dimensional (3D) struc- 
ture simulations to examine the possible effects 
of these mutations on the enzyme structure using 
Phyre2 (19). We found that only the K282—N 
(K282N) mutation in CYP27B1 was close to the 
P450 functional domain (Fig. 1C), whereas the 
other mutated amino acids are not (Fig. 1C and 
fig. S3). Although these mutations are neither lo- 
cated in nor close to known functional domains, 
they may also affect the catalytic activities (20). 

To explore the functional relevance of these 
mutations, we synthesized reindeer, roe deer, 
and goat (Capra hircus) orthologs in vitro and 
tested their enzyme catalytic activities by mea- 
suring the activities in a reconstituted system 
consisting of the enzyme, substrate, and NADPH 
(the reduced form of NADP") (74). For CYP27B1, 
the reindeer protein variant exhibited significant- 
ly higher metabolic efficiency than that of goat 
and roe deer proteins (~2- and ~1.5-fold increase, 
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Fig. 1. Vitamin D metabolism in reindeer. (A) Alterations in the vitamin D metabolism pathways of 
reindeer. The genes involved in this pathway are labeled with different colors. Red genes are under 
positive selection, orange genes exhibit specific mutations in reindeer, and blue genes exhibit 
increased K,/K, values in reindeer. CoA, coenzyme A; h, Planck's constant; v, frequency. (B) Specific 
mutations in the CYP27B1 gene. There are three specific mutations (table S3), including one 
(K282N) in the P450 domain of CYP27B1. The CYP27B1 protein sequences of multiple species 
(indicated with different colors) were aligned, and the alignments of the K282N adjacent region are 
shown here. Single-letter abbreviations for the amino acid residues are as follows: A, Ala; C, Cys; 
D, Asp; E, Glu; F, Phe; G, Gly; H, His; |, lle; K, Lys; L, Leu; M, Met; N, Asn; P, Pro; Q, Gin; R, Arg; S, Ser; 
T, Thr; V, Val; W, Trp; and Y, Tyr. (©) 3D structure simulation of reindeer CYP27B1 compared with 
that of cattle. The 3D structure of POR is provided in fig. S3. (D and E) Enzyme activities of reindeer 
CYP27B1 and POR compared with those of roe deer and goat in vitro. **P < 0.01 calculated from 


the t test. Error bars indicate SD. prot, protein. 


respectively; ¢ test, P < 0.01) (Fig. 1D). Surprisingly, 
the efficiency of the reindeer POR was ~20 and 
~6 times that of the goat and roe deer POR, re- 
spectively (Fig. 1E; t test, P < 0.01). These results 
indicate that reindeer have evolved a more efficient 
vitamin D metabolism pathway than that of other 
mammals. This change likely enables reindeer 
to procure the high levels of active vitamin D 
needed to sustain their metabolism and calcium 
absorption and to promote body fat oxidation 
required to survive in the Arctic and subarctic re- 
gions (21). We also noted that the Ankyrin repeat 
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region of the calcium receptor coding gene TRPV5, 
which is activated by active vitamin D and thus 
affects the reabsorption of calcium, exhibited 
reindeer-specific mutations (fig. S2). This indicates 
that both the up- and downstream pathways asso- 
ciated with vitamin D metabolism evolved in rein- 
deer to secure the supply of calcium and energy. 


Reindeer-specific mutations related 
to fat metabolism 


We further identified fixed reindeer-specific mu- 
tations in genes that play a role in fat metabolism. 


These genes include APOB, which participates 
in the transport of low-density lipoproteins, and 
FASN, which encodes fatty acid synthase, an 
essential enzyme for de novo lipogenesis (22) 
(fig. S2 and table S3). These genes have previ- 
ously been reported as targets of positive selection 
in both polar bears (Ursus maritimus; APOB) and 
Adélie penguins (Pygoscelis adeliae; FASN) (23). 
Although the selected sites were different from 
that seen in polar bears and Adélie penguins, this 
observation suggests that fat metabolism path- 
ways have undergone convergent evolution across 
homothermal polar animals via the independent 
modification of key fat metabolism genes in a 
species-specific manner. 


Mutations related to female 
antler growth 


It is believed that the growth of cervid antlers is 
either driven by or strongly regulated by an- 
drogens (24). Notably, the reindeer is the only 
cervid species in which females (Fig. 2A) as well 
as males grow antlers, and in contrast to other 
species of deer, the removal of the gonads of 
either sex soon after birth does not prevent the 
growth and subsequent seasonal replacement of 
the antlers (5). We therefore hypothesized that 
this phenomenon may be related to the increased 
sensitivity of some genes that regulate antler 
growth to low-level androgens. The characteristic 
5'-TGTTCT-3' motif has been identified as the 
androgen receptor binding site in mammals (25). 
We thus examined the promoter regions of 30 
highly expressed genes associated with cervid 
antlers (fig. S4) (26) and identified three 5’- 
TGTTCT-3’ motifs upstream of the antler-specific 
CCNDI gene (Fig. 2B), which regulates cell cycles 
(27) and is required for chondrocyte prolifer- 
ation (28) and thus is closely associated with 
antler growth (29) (Fig. 2B). In particular, we 
noted that reindeer contain a third reindeer- 
specific motif. To validate its role in gene reg- 
ulation, we used chromatin immunoprecipitation 
coupled with quantitative polymerase chain re- 
action (ChIP-qPCR) to determine the binding 
capacity of the androgen receptor to the motifs 
in the blood samples of five female reindeer. Sam- 
ples from three male roe deer served as controls. 
The results show that the androgen receptor 
binds to the second and third motif upstream of 
the antler-specific CCNDI gene in reindeer, but 
only to the second motif in roe deer (Fig. 2C). 
These findings suggest that this extra motif 
might enhance the expression of CCND1 in the 
presence of low androgen levels, thus enabling 
female reindeer to grow antlers. 


Circadian rhythm regulation 
in Arctic light conditions 


Reindeer experience extended daylight fluctua- 
tions and a markedly different seasonality than 
do cervids from temperate, subtropical, or trop- 
ical habitats (2, 3). The effect of the environment 
on reindeer circadian rhythmicity leads these 
animals to lose daily rhythmic activity in winter 
and summer (2) and exhibit an unusual internal 
rhythm of melatonin secretion (2-4). However, 


2 of 6 


PHOTO: YIFENG YANG 


RESEARCH | RESEARCH ARTICLE | RUMINANT GENOMES 


A Female reindeer antler C 


0.047 1 Motif 2 
Hi Motif 3 


Reindeer Roe deer 


@ CCND1 upstream 7202 bp 
(47923470-47923475) 


@ Reindeer TGTTCT | @ Reindeer TGTTCT | @ Reindeer er 
@ Roe deer TGTTCT | @ Roe deer TGTTCT | @ Roe deer TGTYIT 
@ White-lipped deer tTeTtcT | @ White-lipped deer TGTTCT | @ White-lipped deer TGTYIT 
@ Indian muntjac TGTTCT | @ Indian muntjac TGTTCT | @ Indian muntjac TGTTIT 
e Goat GGGTCT | @ Goat TGTTCT | @ Goat TGTYYT 
@ Cattle AGGTCT | @ Cattle TGTTCT | @ Cattle TGTYYT 
@ Sheep GcetcT | @ Sheep TGTTCT | @ Sheep TGTTYT 
e Argali GGGTCT | e Argali TGTTcT | @ Argali TGTTIT 
@ Forest musk deer GGGTCT | @ Forest musk deer TGTTCT | @ Forest musk deer ---44- 
@ Giraffe AGTTCT | @ Giraffe TGTTCT | @ Giraffe TGTYIT 
e@ Pronghorn GGTTTT | @ Pronghorn TGTTCT | @ Pronghorn TGTYIT 
@ Lesser mouse-deer CATTCT | @ Lesser mouse-deer GGTTCC | @ Lesser mouse-deer CGTYYT 
e Killer whale Gcttct | e Killer whale tectct | @ Killer whale TGTTIT 


@® CCND1 upstream 1094 bp 
(47929578-47929583) 


@ CCND1 upstream 867 bp 
(47929810-47929815) 


© TGTTCT maw Promoter GJ) CCND1 coding region (29:47930677-47943220) 
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the genetic mechanism underlying this loss of a 
robust circadian clock remains unresolved. In 
other mammals, light regulates the biological 
clock in the suprachiasmatic nucleus (SCN) of 
the hypothalamus via a photic signal transduc- 
tion pathway (30) by which light-induced neuro- 
transmitters [PACAP (pituitary adenylate cyclase 
activating polypeptide) and glutamate] alter 
Ca** concentrations and trigger the phosphoryl- 
ation of cAMP response element-binding pro- 
teins (CREBs) (Fig. 3A). We therefore retrieved 
165 genes in reindeer that have been identified 
associated with the circadian rhythm pathway 
in the Kyoto Encyclopedia of Genes and Genomes 
(KEGG) (37) and Reactome databases (32) and 
found eight genes that exhibit reindeer-specific 
mutations in functional domains (PER2, NOCT, 
GRIAI, GRIN2B, GRIN2C, ITPR3, ADCY5, and 
NOSIAP) (Fig. 3A and fig. S5). Moreover, among 
four genes (ADCY2, ADCY8, CALMIA, and CAMK2) 
identified as rapidly evolving (table S4), reindeer- 
specific mutations were also observed in ADCY8 
and CALML4 (fig. S5). 

Phosphorylated CREB activates the transcrip- 
tion of period (PERI, PER2, and PER3) and crypto- 
chrome (CRYI and CRY2) genes, which are the 
central elements for maintaining circadian 
rhythms (33). Three reindeer-specific mutations 
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were found in the Per2 gene of reindeer (fig. S5); 
of these, the P1172T mutation was predicted, 
through 3D structure simulations, to affect the 
binding domain between PER2 and CRY (Fig. 3B). 
To test the effect of this P1172T reindeer-specific 
mutation, T1172 was mutated to the corresponding 
wild-type (WT) amino acid (P), and then a coim- 
munoprecipitation (Co-IP) experiment was per- 
formed (/4). Although WT PER2 (P1172, P-type) 
bound to CRYI, the reindeer-mutated PER2 (1172T, 
T-type) could not bind to CRY1 (Fig. 3C). Studies 
have documented that the binding of PER2 and 
CRY1 is required for sustained photic-induced 
circadian rhythms in peripheral tissues and cells, 
whereas CRY2 only modestly weakens rhythms 
(34). Together, these results show that the rein- 
deer P1172T mutation caused the loss of the bind- 
ing ability of PER2 with CRY1, which can lead to 
circadian arrhythmicity in reindeer. 

Among the other seven circadian rhythm genes 
with reindeer-specific mutations, the NOCT gene 
serves as a key posttranscriptional regulator in 
the circadian control of many metabolic pro- 
cesses, including lipid metabolism (35). Notably, 
we observed several reindeer-specific mutations in 
this gene (Fig. 3D). Furthermore, among reindeer- 
specific mutated genes, three genes (GRIAI, GRIN2, 
and ITPR3) affect Ca?* concentrations in neuron 


cells, with another four genes (ADCY5, ADCY8, 
CAMK2, and NOSIAP) affecting the phosphoryl- 
ation of CREB. The GRIAI and GRIN2 genes 
encode glutamate receptors, the activation of 
which is a critical step in the transmission of 
photic information to the SCN (36). The ITPR3 
gene encodes an intracellular calcium channel 
receptor (37). CREB is phosphorylated via ADCY- 
mediated pathways or CAMK2-, CALMI4-, and 
NOSIAP-mediated pathways (38). Together, these 
results indicate not only that several core proteins 
and genes in the circadian clock are altered in 
reindeer but also that changes have occurred in 
their associated upstream and downstream path- 
ways. We therefore hypothesize that these changes 
may be important components in facilitating the 
adaptation of reindeer to the Arctic’s arrhythmic 
light conditions. 


Mutations related to the docility 
of reindeer 


Although most cervids have evolved sensitive 
and vigilant behavior to avoid predators, includ- 
ing humans (39), reindeer are quite docile. Since 
approximately the start of the first millennium 
CE, reindeer have been used by humans for trans- 
port, as decoy animals, and even for milking. This 
characteristic docility potentially explains why 
the reindeer is the most successfully domesti- 
cated cervid (7). It has been hypothesized that 
neural crest cells (NCCs) may play an important 
role in docility and domestication (40). Evidence 
in support of this hypothesis has been found in 
cats (Felis catus) (41) and dogs (Canis lupus 
familiaris) (42). We investigated reindeer-specific 
mutations in genes related to NCCs and found 
reindeer-specific mutations in 11 genes that re- 
late to development of NCCs (MSX2, [D3, BCATI, 
CAD6, and CADII) (Fig. 4 and fig. S6), migration 
of NCCs (TCOF1, BCAT1, NOTCH2, and NOTCH3), 
and differentiation of NCCs (COL2AI, KIT, and 
SD (Fig. 4 and figs. S6 and $7). In addition, GEM 
(43) and ASCLI (44) were identified as rapidly 
evolving genes that have been reported to be 
associated with the development and differenti- 
ation of NCCs, respectively. The COL2A/ gene re- 
lated to the differentiation of NCCs was also 
positively selected in reindeer (Fig. 4). Studies 
have reported mutations in KIT genes, which 
are associated with white spotting on the body 
(40, 45), in domestic cats (41), horses (46), and 
pigs (47). Notably, white-spotted reindeer gener- 
ally exhibit elevated levels of docility and can be 
easily approached by herders even when the rest 
of the herd is in a state of general excitement 
(48). Thus, we speculate that these specific NCC 
gene mutations may be related to the particular 
docility trait of reindeer. 


Discussion 


Our results reveal the genetic basis of a broad 
spectrum of Arctic deer biology and provide a 
foundation for understanding the adaptive strat- 
egies of mammals to the Arctic environment. 
Our comparative genomic studies and functional 
assays identify a number of genes that exhibit 
functionality related to circadian arrhythmicity, 
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vitamin D metabolism, docility, and antler growth; 
are uniquely mutated; and/or are under positive 
selection in reindeer. Furthermore, these studies 
may provide insights relevant to human health, 
such as how the genetic response of vitamin D in 
reindeer affects bone and fat metabolism and 
how the genes related to circadian arrhythmicity 
could enhance our knowledge of seasonal affec- 
tive disorders, insomnia, and depression. 


Materials and methods summary 


We aligned the genome sequences of 51 rumi- 
nant species (13) and 12 outgroup species to the 
goat (C. hircus) chromosomal genome (49) using 
lastal (version 867) (50). We then merged aligned 


sequences using Multiz (version 11.2) (67) and 
identified orthologous genes on the basis of 
synteny with the cattle genome (52), which is so 
far the best-annotated genome in ruminants. 
The whole-genome tree was obtained from our 
ruminant genome project (73), which was gen- 
erated by ExaML (version 3.0.17) (44) and RaxML 
(version 8.2.10) (53). We used the orthologous 
gene sets of 11 species: white lipped deer (Cervus 
albirostris), milu deer (Elaphurus davidianus) 
(54), black muntjac (MVuntiacus crinifrons), rein- 
deer (R. tarandus) (12), white-tailed deer (Odocoileus 
virginianus), roe deer (C. pygargus), cattle (Bos 
taurus) (52), giraffe (Giraffa camelopardalis 
tippelskirchi) (55), pig (S. scrofa domesticus) (15), 
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Fig. 3. Mutations in the reindeer circadian rhythm genes and pathways. Orange genes have 
reindeer-specific mutations; blue genes exhibit increased K/K; values in reindeer. (A) Light 
regulates the molecular clockwork in reindeer SCN neurons. G, G protein; P, phosphorus; nNOS, 
neuronal nitric oxide synthase; ER, endoplasmic reticulum. (B) The reindeer-specific mutation 
(P1172T) of PER2 is located within the binding domain of the PER2-CRY complex, as determined by 
3D modeling. The site of this mutation is marked in red. (C) Co-IP assays show that reverted 
PER2-T1172P (P-type) proteins can bind to CRY1, but reindeer PER2-P1172T (T-type) cannot bind. 
CRY1-Flag was transfected with or without PER2—hemagglutinin (HA) or PER2-T1172P, and Co-IP 
was conducted. IB, immunoblot. (D) The NOCT gene functional domain has specific mutations 

(fig. S5). The reindeer NOCT gene has two specific mutations (S276P and Q313R) in the C-terminal 
deadenylase domain. The amino acid substitution in reindeer is indicated in red. 
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horse (E. caballus) (16), and human (H. sapiens) 
(17), in the following analyses. The Codeml mod- 
ule in the PAML software package (56) was used 
to identify the rapidly evolving genes and posi- 
tively selected genes. KOBAS (version 2.0) was 
used to conduct the KEGG and Gene Ontology 
(GO) enrichment analysis (57). Protein sequences 
of selected genes were compared between rein- 
deer and other animals to identify reindeer- 
specific mutations. Pfam (version 1.6) (58) was 
applied to determine whether the mutations 
were located in the domain region of the pro- 
tein. To validate whether these identified muta- 
tions are specific for reindeer, three domestic 
reindeer samples from the Greater Khingan 
Mountains, Inner Mongolia Autonomous Region, 
China, and three wild reindeer samples from 
the Snghetta mountain area in south-central 
Norway (7) were whole-genome resequenced 
using an Illumina HiSeq 2500 instrument. 
The high-quality reads were aligned to our 
previously released reindeer reference ge- 
nome (12) using BWA-MEM (version 0.7.12) 
(59). SAMtools (version 0.1.19) (60) was used to 
sort and merge the alignment results. Reads 
around indels were realigned by the Genome 
Analysis Toolkit (GATK version 3.5) with the 
Realigner Target Creator and Indel Realigner 
programs (61). The protein 3D structure of in- 
teraction between PER2 and CRY1 was down- 
loaded from the Protein Data Bank (62). The 
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Fig. 4. Genes altered in reindeer play a role 
in neural crest development, migration, and 
differentiation. Red genes are under positive 
selection in reindeer, orange genes have 
reindeer-specific mutations, and blue genes 
exhibit increased K,/K, values in reindeer. Five 
genes (CAD6, BCAT1, ID3, CAD11, and MSX2) 
participating in the development of NCCs, four 
genes (TCOF1, BCAT1, NOTCH2, and NOTCH3) 
involved in the migration of NCCs, and two 
genes (S/ and KIT) related to the differentiation 
of NCCs into melanocytes were found to have 
specific mutations in reindeer. In addition, one 
gene (ASCL1) related to the differentiation of 
NCCs into neurons and one gene (GEM) 
involved in the development of NCCs showed 
increased K,/K, values in reindeer. One gene 
(COLZAI1) related to the differentiation of 
NCCs into chondrocytes was under positive 
selection in reindeer. 
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3D structures of the CYP27B1 and POR proteins 
were predicated using Phyre2 (19) and then vi- 
sualized using UCSF Chimera (63). The CYP27B1 
and POR genes were synthesized and then cloned 
into the vector pET-28a by Gibson assembly 
methods (64). The plasmid was transformed into 
Escherichia coli BL21 to express these two genes. 
The cultured cells were purified and then used to 
measure enzyme activity as described by Akiyoshi- 
Shibata et al. (65). The reindeer PER2 (P1172T), 
reindeer CRYI, and reverted PER2 (T1172P) genes 
were synthesized and then ligated into vector 
pET-28a by Gibson assembly methods (64). Co-IP 
and immunoblotting were performed as described 
previously (66) to test the interaction between 
PER2 and CRY1. ChIP-qPCR was applied to the 
blood samples of five female reindeer and three 
male roe deer to confirm that androgen re- 
ceptor binds to the reindeer motif 3 with the 
reindeer-specific mutation (for details, see the 
full materials and methods in the supplemen- 
tary materials) (14). 
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flexible F,-F, coupling 
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INTRODUCTION: Mitochondrial F,-F, aden- 
osine triphosphate (ATP) synthases are mac- 
romolecular turbines that couple proton 
translocation across a membrane to ATP syn- 
thesis. Protons are translocated through the F, 
subcomplex in the lipid bilayer by a rotor com- 
posed of a defined number of ¢ subunits, each 
with a proton-binding site, to generate ring 
rotation. A central stalk is firmly anchored 
to the c ring and conveys rotary motion to 
the catalytic F, subcomplex in the mitochon- 
drial matrix, where ATP is produced by rotary 
catalysis. A peripheral stalk connects the two 


} ))° 


Dimer formation 


Coordinated metal ion 


subcomplexes to prevent idle rotation of F, 
with F,. 


RATIONALE: Although ATP synthase com- 
plexes have been investigated for more than 
50 years, several key questions remain. An en- 
during question is how the stoichiometrically 
mismatched c ring in F, (composed of 8 to 17.¢ 
subunits) and the three-fold symmetric F, head 
are efficiently coupled. Another open question 
is the exact pathway taken by protons through 
the membrane, which has been the least well 
characterized part of the mechanism. 


Rotary substates 


Cryo-EM structure of the Polytomella ATP synthase dimer. The F; head (green) is linked 
to the c-ring rotor (yellow) by the central stalk and the peripheral stalk. Insets (beginning 
at top right) show the flexible OSCP hinge (orange); F; rotary substates with subunits 

B (green), y (blue), and c (yellow); a coordinated metal ion in the proton access channel 
(light blue); and the dimer-forming subunit ASA10 (red). 
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RESULTS: We used single-particle cryo-electron 
microscopy (cryo-EM) to characterize the struc- 
ture and dynamics of a complete and active 
dimeric mitochondrial ATP synthase from the 
chlorophyll-less unicellular alga Polytomella 
sp. Together with data obtained by genome 
sequencing and mass spectrometry, our 2.7- to 
2.8-A resolution map allowed us to build a 
full atomic model of the 1.6-MDa complex. The 
model includes the newly identified subunit 

ASAIO, which interlinks the 
two ATP synthase mono- 
Read the full article Merson the lumenal side of 
at http://dx.doi. the membrane. Separation 
org/10.1126/ of 13 independent rotary 
science.aaw9128 states provides a detailed 
adic steatcadates ead fini wie ceeetououite 
the movements that accompany c-ring rotation. 
We find that the F, head rotates together with 
the central stalk and c ring through approx- 
imately 30S or one ¢c subunit, at the beginning 
of each 120° step. Flexible coupling of the F, 
head to the F, motor is mediated primarily by 
a hinge at the interdomain link of the oligomy- 
cin sensitivity-conferring protein (OSCP) sub- 
unit that joins the F, head to the peripheral 
stalk. The extended two-helix bundle of the 
central stalk y subunit interacts with the catch- 
loop region of one 8 subunit of the F, head. 
The resulting mechanism of flexible cou- 
pling is likely to be conserved in other F,-F, 
ATP synthases. Our results provide much-needed 
context to a wealth of published data indicat- 
ing that OSCP is a hub of metabolic control 
in the cell. 

Our high-resolution map of the proton- 
translocating F, complex has revealed a strong 
density, very likely a metal ion, ligated by two 
histidine residues. Recent cryo-EM studies of 
yeast and spinach chloroplast ATP synthase 
contain unannotated densities at the same po- 
sition. Mutational experiments in Escherichia 
coli have shown that an equivalent residue is 
essential to proton translocation. By three- 
dimensional classification, we separated two 
different rotational positions of the c ring and 
showed that the coordination environment of 
the metal ion changes with c-ring position. This 
evidence points toward a role for the metal ion in 
synchronizing c-ring protonation with its rotation. 


CONCLUSION: In ATP synthases, the F, cat- 
alytic head can accompany the rotor through a 
rotation of ~30° at the beginning of each ~120° 
step. This movement allows flexible coupling of 
F, and F,. The interdomain hinge of OSCP facil- 
itates flexible coupling and makes this subunit 
an apposite point for the regulation of ATP 
synthesis. 


The list of author affiliations is available in the full article online. 
*These authors contributed equally to this work. 
{Corresponding author. Email: werner.kuehlbrandt@biophys. 
mpg.de 
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F,F,—adenosine triphosphate (ATP) synthases make the energy of the proton-motive force 
available for energy-consuming processes in the cell. We determined the single-particle 
cryo-electron microscopy structure of active dimeric ATP synthase from mitochondria 

of Polytomella sp. at a resolution of 2.7 to 2.8 angstroms. Separation of 13 well-defined rotary 
substates by three-dimensional classification provides a detailed picture of the molecular 
motions that accompany c-ring rotation and result in ATP synthesis. Crucially, the F; head 
rotates along with the central stalk and c-ring rotor for the first ~30° of each 120° primary 
rotary step to facilitate flexible coupling of the stoichiometrically mismatched F, and F, 
subcomplexes. Flexibility is mediated primarily by the interdomain hinge of the conserved 
OSCP subunit. A conserved metal ion in the proton access channel may synchronize c-ring 


protonation with rotation. 


itochondria carry out controlled oxida- 

tion of reduced substrates to generate an 

electrochemical gradient across the in- 

ner mitochondrial membrane. Movement 

of protons down the gradient through 
the F|F,-ATP synthase drives the production of 
the soluble energy carrier ATP by rotary cataly- 
sis. The ATP synthases consist of two connected 
nanomotors: (i) the membrane-embedded F, sub- 
complex, where proton translocation generates 
torque, and (ii) the soluble F; subcomplex, where 
the torque powers adenosine diphosphate (ADP) 
phosphorylation [see (7) for a recent review]. In 
F,, a ring composed of 8 to 17 ¢ subunits, each 
containing a carboxylate ion-binding site, rotates 
against a bearing formed by a near-horizontal 
helix hairpin of the stationary a subunit (2). Two 
aqueous half-channels at the interface of the c 
ring and a subunit allow protons to enter and 
leave the protonation site. Upon protonation 
from the lumenal channel, a c-ring subunit travels 
by an almost full rotation through the hydro- 
phobic membrane environment before releasing 
the proton in the matrix channel. In the mem- 
brane, the channels are separated by a mere 6 A 
(3), with a strictly conserved arginine residue 
facilitating the passage of deprotonated, but not 
protonated, c-ring subunits. The energy from 
down-gradient proton translocation powers rota- 
tion of the c ring and the firmly attached central 
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stalk (subunits y, 5, €), which conveys rotation 
generated in F, to power ATP synthesis. The two- 
helix bundle of the y subunit extends deep into 
the a3f3 hexamer of the F, head. Rotation of the 
central stalk induces conformational changes of 
subunits o and £ that result in ADP phosphoryl- 
ation. The two-domain oligomycin sensitivity- 
conferring protein, OSCP, connects the F, head 
to a membrane-anchored peripheral stalk stator, 
preventing unproductive rotation of F,. 

The dimeric ATP synthase from mitochondria 
of the chlorophyll-less unicellular alga Polytomella 
sp. contains the signature subunits of mitochon- 
drial ATP synthases, a, Bs, y, 5, €, Cio, a, and 
OSCP. The peripheral stalk and other membrane 
protein subunits typical of mammalian and fungal 
ATP synthases (7) are replaced in Polytomella 
and related species, including the photosynthetic 
model organism Chlamydomonas reinhardtii (4), 
by proteins known as ATP synthase-associated 
(ASA) proteins, which bear no homology to other 
known ATP synthase components (4, 5). These 
subunits form a bulky peripheral stalk that 
links the two ATP synthase complexes into a 
stable dimer. 

ATP synthase must couple translocation of 8 
to 17 protons (1), depending on c-ring size, with 
phosphorylation of three molecules of ADP in 
the catalytic B-subunit sites of the F, head. The 
stoichiometric mismatch of F, and F, poses a 
challenge to efficient energy conversion in the 
complex (6, 7). It has been proposed that the 
central stalk mediates flexible coupling between 
F, and F, (8, 9). Some authors suggest that en- 
ergy is stored by coiling of the y-subunit two- 
helix bundle (10); others propose that energy is 
stored in the globular portions of subunits y and 
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€, where they are attached to the c ring (9). Re- 
cent cryo-electron microscopy (cryo-EM) studies 
resolving rotary states of mitochondrial (1D), bac- 
terial (12), and chloroplast (13) ATP synthase have 
all found that the central stalk rotates as a rigid 
body. Substantial flexibility in the peripheral stalk 
is observed between rotary states in bacterial 
(12, 14) and chloroplast ATP synthase (12), which 
suggests that peripheral stalk bending contrib- 
utes to elastic energy storage in these systems, 
whereas the peripheral stalk of mitochondrial 
ATP synthase dimers is bulkier and appears to 
be less flexible. The robust, rigid peripheral stalks 
of the Polytomella ATP synthase make it an ideal 
system for examining the dynamics revealed by 
rotary states and substates by high-resolution 
cryo-EM. 


High-resolution map of a complete 
F-type ATP synthase dimer 


Cryo-EM single-particle analysis (table S1) of purified 
and active (2) (fig. S1) Polytomella ATP synthase 
dimer yielded a map at 2.94 A overall resolution. 
Local masking improved the resolution to 2.7 to 
2.8 A in overlapping regions covering the full 
complex (table S2 and figs. S2 and S3). Sym- 
metry expansion allowed each monomer to be 
treated independently in image analysis, and 
three-dimensional (3D) classification of the 
central stalk and F, head enabled separation of 
three primary rotary states, at resolutions of 2.8 
to 2.9 A, in the F, head and rotor (Fig. 1 and 
Movie 1). The nucleotide-binding sites Brp and 
Bpp contain Mg”* ADP, and Bx is empty (Fig. 1, B 
to D). The ADP-bound complex was expected as 
the protein was prepared without added nucleo- 
tides. The “arginine finger” (15) of aArg*”? extends 
toward the nucleotide phosphate in Bpp but away 
from it in Brp (Fig. 1, C and D). The cryo-EM data 
were complemented by genomic sequencing and 
mass spectrometry analysis of the purified pro- 
tein, enabling us to build a full atomic model of 
the 1.58-MDa complex, with 62 copies of 18 dif- 
ferent subunits and a total of 14,240 residues 
fitted (Fig. 1, fig. S4, and table S3). The model 
includes the completed polypeptide sequence 
of subunits ASA2 and « and the previously un- 
known subunit ASA10. 


Movie 1. Three-dimensional map of the 
Polytomella ATP synthase dimer. One 
monomer is gray; the other monomer map is 
colored by subunit, as in Fig. 1. 
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Rotary substates reveal concerted 
rotation of F; and central stalk 

To describe the conformational space of the 
F,F,-ATP synthases, we sorted the symmetry- 
expanded dataset into 12 classes. Further sub- 


Fig. 1. High-resolution 
structure of the 
mitochondrial F,F, ATP. A 
synthase dimer from 
Polytomella sp. 

(A) Composite cryo-EM 
map of the 62-subunit, 
1.58-MDa Polytomella 
dimer; the right side 

is colored by subunit. 
The Cjo rotor ring and 
subunit a make up the 
F, motor complex. 
Proton translocation 
through F, causes 
rotation of the c ring 
and the attached central 
stalk subunits y, 6, 

and e. The F; head 
consists of three catalytic 
6 subunits (light green) 
and three a subunits 
(dark green). Long 
C-terminal extensions 
of B wrap around the a 
subunits on the outside 
of F; (see Movie 5). 

The two-domain OSCP. 
subunit links the three 

a subunits to the 
peripheral stalk subunits 
ASAI1 to ASA10 (see 

Fig. 4). Subunit ASA10 
connects the two FiF, 
monomers in the mem- 
brane (see figs. S9 

and S11). The local map 
resolution is 2.7 A for 
the peripheral stalk, Fo, 
and c ring, and 2.8 to 
2.9 A for the F; head 
and central stalk (see 

fig. S2). Inset: Three 
primary rotary states 

(1, 2, and 3) of the FiF, 
monomer are related by 
~120° rotation of the 
central stalk within the 
a383 assembly. Unless 
otherwise specified, sub- 
unit coloring is consistent 
throughout all figures. 
(B) Section through F, at 
the level of bound nucleo- 
tides. Bpp and Brp sites 


contain ADP (red) and Be is empty. The three a subunits bind a structural ATP (orange). Nucleotide- 
g** ions are violet. The central stalk subunit y (blue) engages with the catch loop 


coordinating 


division of a mixed class yielded a total of 
13 distinct rotary substates (fig. S2), which were 
refined to 2.8- to 4.2-A resolution (fig. S3 and 
table S2). The subclasses can be placed into a 
meaningful sequence according to the rotation- 


of the Be subunit (see fig. S7). (© and D) Catalytic sites of subunits Bpp and Brp. A well-defined 
Arg side chain of subunit a (the “arginine finger”) extends toward the nucleotide phosphate in the Bpp site. ATP in the Brp site has hydrolyzed to 


ADP during protein isolation. Amino acid abbreviations: D, Asp; E, Glu; K, Lys; N, Asn; R, Arg; T, Thr. 
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al position of the rotor with respect to the stator 
(fig. S5). Each subclass was assigned to one of 
three primary rotary states, with six subclasses 
for state 1 (1A to 1F) (Fig. 2 and figs. S5 and S6), 
four for state 2 (2A to 2D), and three for state 3 
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(3A to 3C). The three primary states are sepa- 
rated from one another by ~120° rotation of the 
central stalk within the F, head (Fig. 1A, inset). 
In contrast, substates of any given rotary state 
differ by concerted rotation of the F, head and 
rotor within the first 15° to 32° of each ~120° 
step (Fig. 3, Movies 2 and 3, and fig. S5). Con- 
tact between the ; site and the y subunit [the 
“B-catch loop” (J6)] is maintained between sub- 
states (Movie 3 and fig. S7), as is the nucleotide 
occupancy of 8 subunits (Fig. 3B and fig. S5). 
The pivot point of this movement is the single- 
peptide chain connecting the two OSCP do- 
mains (henceforth the “OSCP hinge”) (Fig. 4 


Fig. 2. Rotary substates of mitochondrial ATP synthase. The three primary rotary 
states are subdivided into a total of 13 rotary substates at resolutions between 2.8 and 4.2 A 


and Movie 4). The top of the peripheral stalk 
flexes only slightly. Previous studies have sug- 
gested that the interdomain region of OSCP 
may be flexible (77-19). In comparing all 13 sub- 
states, no coiling of the central stalk is apparent 
(fig. S6). 

In the first ~30° of rotary steps 1 and 2 (in 
moving from state 1A to IF and state 2A to 2D), 
the c ring advances by almost one subunit with 
respect to the a subunit (Fig. 3A) while the 
position of y with respect to the o383 hexamer 
remains virtually unchanged (Fig. 3B and fig. 
S5). Between substates of state 3 (in moving 
from state 3A to 3C), the ring advances by about 


(see figs. S2 and S3 and table S2). The number of resolved substates and the angular 
increments between them differ for each of the three primary rotary states (fig. S5). 

(A) Composite cryo-EM map of rotary substate 1A at 2.9-A resolution. (B to D) Overlays of 
substate maps indicate concerted movement of the N-terminal domain of OSCP (B), the 
F, head with the central y subunit (C), and the c ring (D) from substate 1A to substate 1F. 
Projected map densities of other subunits are shown in light gray for reference. 
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half ac subunit together with the rotor (Fig. 3A). 
As the rotor moves a further ~90° to the starting 
position of the next primary rotary state (e.g., 
from state 1F to 2A), the F, head recoils by ~30° 
to its original position, for a cumulative ~120° 
power stroke of y within F, for this step. Given 
that this motion is thermally accessible for the 
complex at equilibrium, it would almost cer- 
tainly contribute to flexible coupling of F, and 
F, also under turnover conditions. 

We observe an uneven distribution of particles 
between the three major rotary states, with 52% 
in state 1, 23% in state 2, and 21% in state 3, in 
line with other recent cryo-EM studies of ATP 


C1) ring 
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synthases (11-13) and with a recent single- 
molecule study (20) that reported an asymmetry 
in the time-averaged population of rotary states 
in the presence of ADP. In comparing substates 
of a given primary rotary state, the later sub- 
states of the Polytomella complex (1D, 1E, 1F, 2C, 
2D, and 3C) are consistently less populated 
than the earlier substates (fig. S2 and table 
82), which suggests that these are higher- 
energy states. 

The interaction between F, and y appears 
strongest in the catch loop region (16) of Bx, 
where conserved residues BAsp**”, BAsp**®, and 
BThr?*” (BAsp?°*, BAsp?°°, and BThr?°* in 
Escherichia coli) form ionic and hydrogen-bond 
interactions with yArg*°° and yGin?®” (E. coli 
yArg?° and yGin?®). The salt bridge between 
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BAsp*** and yArg?°° forms via a resolved water 
molecule (fig. S7). Throughout the 20° to 30° 
concerted rotation of F, with y, the interaction 
between the B; catch loop and y is maintained, 
although subtle loop movements suggest that 
the hydrogen-bonding partner of BAsp*? changes 
from yGln?%” to yAsn%* between substates (e.g., 
from state 1A to 1F; fig. S7). Previous disruption 
of these interactions by mutation in E. coli 
eliminated or strongly reduced ATP hydrolysis 
activity and the ability to grow on succinate (J6). 
A proposed function of such an interaction is 
that, in ATP synthesis mode, it would stall rota- 
tion of y until the By site has bound ADP, to 
prevent unproductive rotation (16). The ma- 
jor impact of catch loop mutations on both 
synthesis and hydrolysis activity implies that 


STATE 2 
120° 


this region plays a broader role in the con- 
formational changes required for catalysis. On 
the basis of our results, we suggest that this 
interaction does not stall rotation completely; 
rather, a proton may be translocated at F, while 
the F, head waits for substrate binding. 

Studies aiming to characterize flexible coupling 
have focused on the central stalk (27), often 
working with an F, and rotor subcomplex. When 
the whole F,F, complex was examined, it was 
typically attached to substrate and reporter mole- 
cules at subunits B and c to probe for flexibility 
along the F-F, axis (9, 22, 23). A recent single- 
molecule study of F,F, reported forward and 
backward stepping of the rotor by ~30° (i.e., up 
to one c subunit) (22). However, with F, anchored 
to the support by subunit B and a probe attached 


STATE 3 


120° 


Fig. 3. Concerted rotation of F, with central stalk and c ring. 

(A) Progressing in the direction of ATP synthesis, the C9 ring (yellow) rotates 
counterclockwise (as seen from F,) with respect to the a subunit (light 
blue) by up to 32° for substates of the same primary rotary state. The 
primary rotary states differ by power strokes of ~120° The position of 
one c subunit in the first (black outline) and last rotary substate (pink) of 
each primary state is indicated. (B) Between substates of a given rotary 
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state, the F, head rotates together with the c ring and central stalk 
before recoiling to its original position in the first substate of the 
subsequent primary rotary state. Subunits a (dark green) and B (light 
green) with their bound nucleotides (red and orange) are shown for 

the first substate in each primary state. The position of the last substate 
in each primary state is indicated in light blue in the background. The 
peripheral stalk position is shown in gray. 
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to c, flexibility at OSCP would not be detected. 
The observed ~30° stepping angle appears to re- 
flect regular energy minima in the c-ring/a- 
subunit interaction rather than joint rotation 
of the c ring and central stalk with F,. Single- 
molecule studies of flexible coupling may need 
to explore a wider range of anchoring points. 
Several lines of evidence support the idea that 
OSCP-mediated flexible coupling plays a role in 
other F-type ATP synthases: (i) The structure 
and polypeptide sequences of B, y, and OSCP sub- 
units involved in this process are highly con- 


served. (ii) An OSCP subunit (or 6 in chloroplasts 
and bacteria) is found in all three lineages of ATP 
synthase known to have independently acquired 
novel peripheral stalk components (24, 25). (iii) A 
cryo-EM study of bovine ATP synthase (11) iden- 
tified rotary substates of this complex, albeit at 
lower resolution, indicating rotation of F, rela- 
tive to the remainder of the complex. Reeval- 
uating these results in light of our data, they do 
in fact show concerted rotation of F, with the 
central stalk. (iv) Well-known inhibitors and 
regulators of ATP synthase bind to OSCP, affect- 


ing the catalytic rate and Michaelis constant (Kj,) 
(26, 27), suggesting a mechanistic rather than 
merely structural role for this subunit. The sim- 
pler bacterial and chloroplast ATP synthases have 
a thinner, flexible peripheral stalk relative to 
Polytomella, and previous cryo-EM studies have 
found substantial flexibility in the peripheral 
stalk even between primary rotary states (12-14). 
In these simpler systems, two or more compo- 
nents may mediate flexible F,-F, coupling. 

A body of literature suggests that OSCP- 
mediated flexible coupling (Fig. 4 and Movie 4) 


STATE 1 


Fig. 4. Subunit OSCP connects the F, head and peripheral stalk as a 
flexible hinge. (A) Overview of FiF, monomer indicating the position of 
OSCP (orange) on top of the F; head (green) relative to the peripheral stalk 
(gray). (B) The two-domain OSCP interacts with the helical C-terminal 
extensions of subunits al, a2, and a3 (green) (see Movie 4). Extensions 
of a2 and a3 form short helix bundles with helices of the globular 
N-terminal OSCP domain, whereas al binds to the elongated C-terminal 
OSCP domain that is attached to the peripheral stalk via the a1 extension. 
The two OSCP domains are connected by a flexible peptide link that 
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STATE 3 


facilitates ~30° back-and-forth rotation of the N-terminal domain with 

F, (curved black arrow) relative to the C-terminal OSCP domain and 
peripheral stalk. (C) Between substates of a given primary rotary state, 
the N-terminal OSCP domain rotates with F, by up to 28° (straight dashed 
lines) in synthesis direction (orange arrows) and then recoils to its 
starting position in the first substate of the next primary rotary state 
(red arrows). The OSCP position in the first substate of each primary state 
is shown in orange; the OSCP position in the last resolved rotary state 

in each primary state is indicated in light blue in the background. 
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eP1 eP2 


H248 H252 


Fig. 5. Metal ion and ordered water molecules in the F, proton access 
and release channels. (A) The H5/H6 helix hairpin of subunit a (light blue) 
forms the bearing against which the cio ring (yellow) rotates, shown here 
for c-ring position 1. (B) Sectional view of H5 and outer-ring c-subunit 
helices with proton-binding cGlu™, c-ring position 1. (C and D) Water 
molecules (purple density) coordinated by the ion-binding cGlu” and 
adjacent cSer™ in the matrix channel for c-ring positions 1 (C) and 

2 (D). Maps are displayed at density thresholds of 0.035 for subunit a, 
0.025 for the c ring, and 0.025 (C) or 0.018 (D) for water. (E) The strictly 
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conserved aArg?2 and aGIn@°° that separate the proton access and release 
channels in the membrane bind two water molecules, shown in the consensus 
C2-refined map of the F, region. (F to H) Conserved residues aHis*“° and 
aHis?°? in H5 of subunit a coordinate a metal ion (green). Bond distances 
change depending on c-ring position. The predominant c-ring position 
[position 1, full color, and (H)] accounts for 58% of particles; a second position, 
differing by rotation of roughly 13° [position 2, faint yellow, and (G)] accounts 
for 33% of particles. Except where specified, all maps in a given panel are 
rendered at the same density threshold. E, Glu; H, His; Q, Gln; R, Arg; S, Ser. 
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has broad physiological implications, with a 
number of regulatory mechanisms acting on 
OSCP. Cyclophilin D, a regulator of mitochon- 
drial permeability transition, binds to OSCP 
and inhibits ATP synthase activity (28), as does 
the immunomodulator Bz-423, which affects both 
maximum rate (Vinax) and Ky, for ATP hydrolysis 
(27). Deacetylation of a conserved Lys near the 
hinge region of OSCP in response to exercise 
stress affects mitochondrial ATP levels (29). The 
hormone estrogen binds to OSCP (30) and inhib- 
its ATP synthase activity (37, 32) with possible 
relevance to neuroprotection (33). The tumor 
suppressor p53, under nonstress conditions, binds 
to OSCP and promotes increased O, consumption 
and decreased ROS levels in mitochondria (34). 
These diverse and potent modulations of ATP 
synthase activity cannot be reconciled with a 
model of ATP synthase in which OSCP forms a 
static bridge between the F, head and peripheral 
stalk. Our results show that, on the contrary, 
OSCP plays a dynamic role in the flexible cou- 
pling of F, and F,;. The concept of a dynamic 
OSCP will be essential to understand and exploit 
modulation of ATP synthase activity, with rele- 
vance to neurodegeneration and traumatic brain 
injury, ischemia-reperfusion injury, and immu- 
nomodulation, among others. 


An essential histidine ligates a metal ion 
that is sensitive to c-ring rotation 


In respiring mitochondria, the pH of the lumen 
is 7.2 (35), while the pK, of the c-ring carboxylate 
in an aqueous environment is around 4.5 (36); 
thus, protonation of the c ring needs to be as- 
sisted by the local environment in the lumenal 
entrance channel in order to achieve high turn- 
over rates. Mutational studies in E. coli (37) have 
established that residues aGlu””’ and aHis”” are 
essential for both ATP synthesis and proton per- 
meability of F, in membranes stripped of F,. The 
mutations @E219H and aH245E show low levels 
of activity, while the double mutant is more ac- 
tive than either single mutant, suggesting that 
the residues interact to facilitate c-ring proton- 
ation. Our map shows a strong, nonpeptide 
density ligated by aHis”** (aGlu”” in E. coli) and 
aHis”? of aH5 (Fig. 5) with bond lengths of 2.0 
to 2.2 A. The strong density and coordination 
environment favor assignment to a metal ion, 
provisionally annotated as Zn** in the deposited 
models. At high threshold levels, the bound 
species appears diatomic, perhaps because of a 
heavy ligand (Fig. 5, B and F to H). Both ligating 
His residues are present in mammalian ATP syn- 
thases (fig. S8). The aHis”** position is occu- 
pied by His or Glu in all F-type ATP synthases, 
with the notable exception of Na*-translocating 
complexes (fig. S8A). Recently published high- 
resolution cryo-EM maps of yeast mitochondrial 
(38) and spinach chloroplast ATP synthase (13) 
both indicate a strong, unattributed nonpeptide 
density at the same position in the lumenal 
channel, near the residue equivalent to aHis*® 
(aHis'® and aGlu’®®, respectively) (fig. $8, C 
and D). In both instances, a single strong den- 
sity was observed as compared to the two den- 
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sities in our map, although this may reflect the 
lower local resolution (3.6 A and 3.4 A) of the 
yeast and spinach maps. 

Masked 3D classification of the c ring and a 
subunit yields two distinct c-ring rotary posi- 
tions at 2.7-A and 3.1-A resolution, represent- 
ing 58% and 33% of particles (fig. S2). The 
positions differ by rotation of the ¢ ring by ap- 
proximately one-third of a c-ring subunit (13°) 
(Fig. 5F). Comparing these maps, it is clear that 


2, es 
C2, Fes 
“Gy EI 


Movie 2. Morph of 13 rotary substates of 
the Polytomella ATP synthase monomer. 
The c ring (yellow) and central stalk (Subunits: 
y, blue; 5, cyan; e, pale blue) together rotate 

in three roughly equal ~120° steps. The F; head 
moves with them for the first ~30° of each 
step. a, dark green; B, bright green. The 
two-domain OSCP subunit (orange) works 

as a hinge between the moving F; head and 
the stationary peripheral stalk (gray). 


Movie 3. Section through movie 2 at the 
level of nucleotide-binding sites in the 
F, head. Catalytic sites of the B subunits 
(bright green) are red. Binding sites for 
the structural ATP in the three a subunits 
(dark green) are orange. The B catch loop 
is drawn in purple. Central stalk subunit 

y, blue; peripheral stalk, gray. 
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the configuration of the metal ion in the lume- 
nal channel changes with rotation of the ¢ ring, 
with the distance between the nonpeptide den- 
sities changing from 2.2 to 3.7 A between c-ring 
positions 1 and 2 (Fig. 5, F to H). The coordi- 
nation of the metal is sensitive to c-ring position, 
suggesting that it may play a role in synchroniz- 
ing c-ring protonation with its rotation. In both 
positions, distances between the HiseN and metal 
ion are in the 2.0 to 2.2 A range, consistent with 
dative bonds and thus with unprotonated HiseN. 
Available mutational data strongly suggest that 
the role of aHis*** is to protonate aGlu***, which 
would require that aHis** is itself transiently 
protonated. Protonation of either His or Glu 
at this position would preclude their ability to 
coordinate a metal ion. Further work will be 
needed to clarify the identity, dynamics, and 
function of this metal ion. 


Ordered water in the aqueous 
half-channels and at the essential Arg7*” 


The membrane-embedded a subunit and c ring 
together define the pathway for protons to move 
across the inner mitochondrial membrane (Fig. 
5A). The sequences of both subunits are con- 
served in all F-type ATP synthases, including 
those of bacteria and chloroplasts (7). The sim- 
ilarity (identity) scores of human and Polytomella 
mitochondrial ATP synthase are 44.9% (27.6%) 
for subunit a and 57.0% (38.7%) for subunit 
c. Previously, we showed (3) that the aqueous 
half-channels that conduct protons to, and away 
from, the c-ring glutamate are separated by a 
distance of 5 to 7 A in the center of the mem- 
brane. We were able to model ordered water 
molecules within both channels in the current, 
high-resolution map (Fig. 5, C to E). In the matrix 
channel, a water molecule is coordinated by 
cSer'” opposite the ion-binding cGlu™ of the 
adjacent c subunit, favoring the idea that the 
c-ring glutamate may be directly deprotonated 
by water in this location. Two water molecules 
are enclosed by the strictly conserved aArg?”® 
and aGln®®’ in a pocket that appears not to be 
continuous with the aqueous channels (Fig. 5E). 
The interaction of these conserved residues with 
each other via a water molecule would reduce 
the flexibility of the aArg”®® side chain. In the 
lumenal proton access channel, the well-ordered 
water lies between the assigned metal ion den- 
sity and the strictly conserved aArg”’®. Exam- 
ining these waters, it is clear that the channel 
extends to the a/c interface close to aAre””’, as 
previously reported (3). However, it would ap- 
pear that these water molecules do not directly 
protonate the c-ring glutamate, because muta- 
tion of aHis*“ and aGlu?"’ in E. coli eliminates 
proton permeability of the F, complex in stripped 
membranes (37). Instead, the c ring is likely 
protonated as it rotates past aGlu*** (aHis”** in 
E. coli), immediately before passing into the hy- 
drophobic lipid bilayer. Water in the lumenal 
channel permeating all the way to aArg””? would 
provide an aqueous environment to prime the 
c-ring glutamate for protonation by inducing an 
outward-facing conformation (36). At the same 
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time, the small distance between half-channels 
would generate a substantial field and result- 
ing torque on the c-ring glutamate, which may 
be needed to overcome the energetic barrier of 
moving the c-ring glutamate past the gating 
Arg”? (7, 3). 


C-terminal fp extensions contact the 
adjacent o-subunit 


In chlorophycean algae, the mitochondrial cata- 
lytic 8 subunit resembles those of the canonical 
ATP synthases closely, except that the chloro- 
phycean subunit has acquired a ~60-residue 
C-terminal extension. Our map shows that this 
extension wraps vertically around the adjacent 
a subunit (Movie 5) and moves with subunit B 
as it adopts an open or closed conformation. A 
~15-residue N-terminal extension of the o sub- 
units present in chlorophycean algae, but not 
in other systems, is in a different configuration 
for each subunit (Fig. 1B, Fig. 4A, and Movie 5). 
One of these extensions, together with the distal 
OSCP domain, anchors one a subunit to the periph- 
eral stalk, while the other two form short helix 
bundles with the proximal globular domain of 
OSCP, attaching it to F,. The N- and C-terminal 
extensions would enhance the stability of the 
chlorophycean mitochondrial ATP synthase, which 
may be an advantage under stress conditions, 
as when Polytomella converts to a dormant cyst 
in nutrient-depleted environments (39). 


ATP synthase-—associated proteins 1 to 
10 form a rigid peripheral stalk 


In Polytomella, dimer formation is mediated by 
the peripheral stalk consisting of 10 ASA sub- 
units that have no homologs in other known ATP 
synthases outside this class of green algae. We 
have modeled all nine previously described ASA 
subunits unambiguously into the 2.7 A map 
(Fig. 1, Movie 1, and fig. S4), as well as an un- 
known subunit that we term ASA10. Mass spec- 
trometry of the purified complex and DNA 
sequencing of the ~50-Mb Polytomella genome 
allowed us to identify ASA10 (fig. S9) and to ob- 
tain complete sequences for subunits e and ASA2 
(fig. S10). All subunits were built into the cryo- 
EM map de novo, as there are no homologous 
structures in the database. 

The stability of the Polytomella dimer is due to 
the rigid peripheral stalks and their close inter- 
action. ASAI is at the core of the soluble stalk 
region, and forms a bridge between monomers 
with a helix-turn-helix motif 75 A above the mem- 
brane surface (fig. S11). F, has six transmembrane 
helices and four long, membrane-intrinsic helices, 
which form the two helix hairpins of the con- 
served a subunit (2). Of the six transmembrane 
helices, two belong to ASA6 and one each to 
ASA5, ASA8, ASAQ, and ASA10. The two ASA10 
subunits form the primary dimer interface on 
the lumenal side of the membrane, and their C 
termini are intertwined (figs. S9 and S11). Little 
or no other direct interaction is seen within the 
membrane, with lipids or detergent filling the 
space between monomers. ASA3 forms the char- 
acteristic Armadillo repeat domain above the 
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membrane surface on the matrix side, and ASA2, 
ASA4, and ASA7 sit atop the peripheral stalk. 


Outlook 


We have determined the structures of Polytomella 
ATP synthase in 13 rotary substates, allowing 
us to visualize most if not all of its thermally 
accessible conformations. Critically, we have 
shown that the F, head and rotor move together 
through a 20° to 30° rotation; thus, the ¢ ring 
rotates relative to the a subunit while the posi- 
tion of the central stalk within the F, head is 
preserved (fig. S12). This movement ensures flex- 
ible coupling of the two symmetry-mismatched 
nanomotors of ATP synthase, F, and F,, for all 
possible c-ring stoichiometries. Flexible coupling 
appears to be mediated predominantly by OSCP, 
which is emerging as an important target of 
cellular and pharmacological control. Whereas 
most previous structural studies of ATP synthase 
have characterized inhibited complexes, this work 
has instead examined the active ADP-bound 
form. The conformational flexibility we observe 
in these structures may have been suppressed 
in the previously determined auto-inhibited and 
inhibitor-bound states. Further studies will ad- 
dress the energetics of OSCP bending and inte- 
grate these results into a full catalytic scheme of 
ATP synthesis. A metal ion bound at the con- 
served residue aHis”** within the lumenal chan- 
nel is sensitive to the c-ring rotary state and is 
likely to play an important role in protonating 
the c-ring carboxylate. A major unanswered ques- 
tion is how the strictly conserved aArg”®® fa- 
cilitates movement of deprotonated, but not 
protonated, c-ring subunits between the aque- 
ous half-channels. The high-resolution detail 
provided by our model will be important for 
answering this question. 


Materials and methods 
Protein isolation and purification 


ATP synthase from Polytomella sp. was purified 
as described (2) with slight modifications. Mito- 
chondrial membranes (175 mg) were solubilized 
for 30 min at 4°C in a volume of 12 ml of buffer 
containing 30 mM Tris-HCl, pH 7.8, 2 mM MgCl, 
50 mM NaCl and 2.9% (w/v) n-dodecyl-B-p- 
maltoside (DDM) to a final detergent:protein 
weight ratio of 2:1. After centrifugation (20,000g, 
15 min, 4°C) to remove unsolubilized material, 
the filtered supernatant was loaded onto a POROS 
GoPure HQ column (Thermo Fisher Scientific) 
equilibrated in buffer A [30 mM Tris-HCl, pH 7.8, 
2 mM MgCly, 50 mM NaCl and 0.015% (w/v) 
DDM] on an Akta purifier (GE Healthcare). 
After washing the column with 100mM NaCl in 
buffer A, ATP synthase dimers were eluted with a 
linear 100 mM to 300 mM NaC! gradient in buf- 
fer A. For the final purification step, fractions 
containing ATP synthase dimers were concen- 
trated in Vivaspin 500 columns with 100,000 
molecular mass cutoff and then loaded onto a 
Superose6 Increase 3.2/30 size exclusion column 
(GE Healthcare) equilibrated in buffer B [30 mM 
Tris-HCl, pH 7.8, 2 mM MeCl,, 40 mM NaCl, 0.05% 
(w/v) DDM] on an Ettan purifier (GE Healthcare). 
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Movie 4. Hinge movement of the two-domain 
OSCP subunit (orange). The proximal a-helical 
OSCP domain at the right is attached to the F, 
head by the N-terminal extensions of two 

of the three a subunits (dark green). The distal 
B-sheet OSCP domain is attached to the 
peripheral stalk (gray) by interaction of one 
OSCP helix and the N-terminal extension of one 
of the three a subunits. 


Fractions of pure and active (2) ATP synthase 
dimers were collected on ice and used directly 
for cryo-EM specimen preparation. Purity and 
activity of the sample was analyzed by blue- 
native polyacrylamide gel electrophoresis (BN- 
PAGE) and in-gel ATP hydrolysis assay (40). For 
two-dimensional polyacrylamide gel electropho- 
resis (2D SDS-PAGE) (41), subunits of the ATP 
synthase dimer were first resolved on a 10% 
acrylamide/8 M urea gel before being separated 
in the second dimension in a 16% acrylamide gel. 


Electron microscopy and 

image processing 

A solution of 3-4 mg/ml purified complex was 
applied to C-flat 1/1 or 2/1 holey carbon grids 
(Science Services GmbH) glow-discharged for 
45 s at 0.15 mA. Grids were blotted for 4-5 s at 
blotforce 20 using a Vitrobot, and vitrified in 
liquid ethane. Electron micrographs were col- 
lected on a Titan Krios G2 with Falcon III de- 
tector in counting mode, at 300 kV and 75,000x 
magnification for a calibrated pixel size of 
1.053 A. 81-frame movies were automatically 
recorded with a dose of 0.4 e A? s"', using EPU 
software. Movies were aligned using MotionCor2 
within the Relion3-beta wrapper and CTF pa- 
rameters were calculated using CTFFind4.1.10. 
Particles were picked using Gautomatch with 
templates generated by 2D classification of a 
previous dataset (3), low-pass filtering images to 
20 A. The dataset was cleaned using 3D classi- 
fication in Relion3. Following a refinement of the 
full dimer with solvent masking, per-particle CTF 
parameters and beam tilt were refined, followed 
by Bayesian polishing and a second round of CTF 
and beam tilt refinement. The resulting 3D re- 
construction gave an overall resolution for the 
dimer of 2.94 A; masking of the stationary 
membrane-bound and lower peripheral stalk 
portions of the complex improved the resolution 
for this region to 2.69 A. Symmetry expansion 
of the dataset using relion_particle_symmetry_ 
expand, followed by cl refinement of the upper 
peripheral stalk gave a resolution of 2.75 A for 
this portion of the complex. 3D classification 
of the pre-aligned symmetry-expanded dataset, 
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Movie 5. Extensions of subunits a and f in 
Polytomella ATP synthase. The polypeptide 
sequences of subunits o and B in mitochondrial 
FiF, ATP synthases of chlorophycean algae 
(including Polytomella sp.) have characteristic 
extensions at their N and C termini. The 
~15-residue N-terminal extension of subunit a 
interacts with OSCP (see Fig. 4 and movie 4). 
The ~60-residue C-terminal extension of 

the B subunit wraps around the outside of the 
neighboring subunit a in the F; head. Both 
extensions would render the chlorophycean 
FF, complex more stable than that of mammalian 
or fungal mitochondrial ATP synthases. 


with T = 20 and no shifts, allowed separation of 
12 rotational states, with one class being low 
resolution and another being a mixture of states; 
further classification of the latter gave three ad- 
ditional rotary states for a total of 13 classes. Each 
was refined first with a mask around the selected 
monomer, and then with a mask around only the 
F, head and rotor portion of the complex. The 
classes were grouped according to the major ro- 
tary state they represent, and these were refined 
as for the substates. All steps described above 
were carried out in Relion3. The polished, CTF- 
refined dataset was imported into cryoSPARC 
(42) and refined using nonuniform refinement 
with automated masking; this map was used to 
assist manual building of the less ordered por- 
tions of ASA3 and ASAQ. For all rotary states and 
substates, composite maps were generated from 
the dimer-masked and F,+rotor-masked refined 
maps using phenix.combine_focused_maps (43). 


Map and model analysis 


Models of subunits a, OSCP, ¢, and ASA1 to ASA10 
were built manually in Coot (44). Models of a, 
B, y, 6, and c¢ were refined manually in Coot 
based on homology models calculated using the 
ModWeb server (45) on templates 5DN6 (46), 
2V7Q (47), 3ZIA (48), and 2XND (49), respec- 
tively. Models were refined by real-space re- 
finement with simulated annealing in Phenix (50) 
and checked manually before deposition. Model 
quality statistics are as given by the program 
phenix.validation_cryoem and are summarized 
in table S3. For comparison of rotary states and 
substates, maps were aligned to a map of F, and 
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the lower peripheral stalk (EMD-4806). Models 
of rotary states and substates were aligned using 
the Matchmaker tool of USCF Chimera (57), 
fitting to the a subunit. Figures were made 
using UCSF Chimera and ChimeraX (52). 


Polytomella sp. genomic sequence 


Genomic DNA was purified following a pub- 
lished protocol (53) with modifications. Briefly, 
Polytomella sp. cells were harvested in their 
logarithmic phase and resuspended in solubili- 
zation buffer containing 100 mM Tris-HCl, pH 
8.0, 40 mM EDTA, 100 mM NaCl, 2% (w/v) lauryl 
sarcosine, 1% (w/v) SDS, RNase A (20 pg/ml), and 
protein kinase K (0.9 mg/ml). After 1 hour at 4°C, 
DNA was extracted twice with phenol:chloroform: 
isoamy] alcohol (25:24:1) and the aqueous phase 
was ethanol-precipitated for 1 hour on ice. 
Genomic DNA was further purified by 4 M LiCl] 
and 7.5 M ammonium acetate before being 
ethanol-precipitated again and finally washed 
twice with water. Genomic sequencing was car- 
ried out by Microsynth AG with 50-fold sequence 
coverage. The genome size was ~50 Mb. A six- 
frame translation of the sequence was used as a 
query database for mass spectrometry results. 


Mass spectrometry 


Tryptic digestion of proteins in solution or ex- 
cised from 2D gels was performed according to 
published procedures (54). Proteolytic digests 
were loaded by nano-HPLC (Dionex RSLCnano) 
on reverse-phase columns (trapping column: Ac- 
claim PepMap c18, particle size 2 um, L = 20 mm; 
analytical column: Acclaim PepMap c18, particle 
size 2 um, L = 50 cm; Thermo Fisher Scientific) 
and eluted in organic phase gradients (Buffer A: 
95% HO, 5% DMSO, 0.1% formic acid; Buffer B: 
80% acetonitrile, 15% HO, 0.1% formic acid). 
Typically, gradients were ramped from 4% to 
48% buffer B in 80 min at a flow rate of 300 nl/min. 
Peptides eluting from the column were ionized 
online using a Nanospray Flex Ion source and 
analyzed in an Orbitrap Elite or Q Exactive Plus 
mass spectrometer. Mass spectra were acquired 
over the 350 to 1600 m/z range at a resolution 
of 120,000, and sequence information was ac- 
quired by computer-controlled, data-dependent 
automated switching to MS/MS mode in response 
to collision energies based on mass and charge 
state of the candidate ions. 

Datasets were processed with the Proteome 
Discoverer software package (version 2.1.0.81). 
Spectra were internally recalibrated on auto- 
proteolytic trypsin fragments when applicable. 
Proteins were identified by matching the derived 
mass lists against a customized Polytomella data- 
base (combination of NCBI nr Polytomeila, a full 
six-frame translation of a dataset acquired by 
whole-genome shotgun sequencing (Microsynth 
AG) and a list of common contaminants with the 
program Sequest (Thermo Fisher Scientific). In 
general, a mass tolerance of 10 ppm for parent 
ion spectra and 0.6 Da for fragment ion spectra, 
two missed cleavages, oxidation of Met (dynamic 
modification), acetylation of the protein N terminus 
(dynamic modification) and carbamidomethyl- 
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cysteine (fixed modification) were selected as 
matching parameters in the search program. 
Results were evaluated using a percolator node 
(high-confidence q value, FDR < 0.01) to exclude 
false positives. Proteome data have been uploaded 
to the PRIDE online repository (55). 


Protein sequence analysis 


Homologs of the ASA10 sequence were found by 
BLAST searches of the NCBI (56) and Phytozome 
(57) databases. Sequence alignments were car- 
ried out using Clustal Omega (58) and formatted 
using JalView (59). 
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The glycan CA19-9 promotes 
pancreatitis and pancreatic 


cancer in mice 
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Glycosylation alterations are indicative of tissue inflammation and neoplasia, but whether 
these alterations contribute to disease pathogenesis is largely unknown. To study the role of 
glycan changes in pancreatic disease, we inducibly expressed human fucosyltransferase 3 and 
61,3-galactosyltransferase 5 in mice, reconstituting the glycan sialyl-Lewis?, also known 

as carbohydrate antigen 19-9 (CA19-9). Notably, CA19-9 expression in mice resulted in rapid 
and severe pancreatitis with hyperactivation of epidermal growth factor receptor (EGFR) 
signaling. Mechanistically, CA19-9 modification of the matricellular protein fibulin-3 increased 
its interaction with EGFR, and blockade of fibulin-3, EGFR ligands, or CA19-9 prevented EGFR 
hyperactivation in organoids. CA19-9—mediated pancreatitis was reversible and could be 


suppressed with CA19-9 antibodies. CA19-9 also cooperated with the Kras 


12D oncogene to 


produce aggressive pancreatic cancer. These findings implicate CA19-9 in the etiology of 
pancreatitis and pancreatic cancer and nominate CA19-9 as a therapeutic target. 


ancreatitis, or inflammation of the pan- 

creas, is a painful, recurrent, and occasion- 

ally lethal medical disorder with limited 

treatment options. The incidence of acute 

and chronic pancreatitis is rising (7). Pan- 
creatitis accounts for more than 275,000 hospi- 
talizations in the United States per year, and the 
number of hospital admissions has increased 
by 20% over the past decade (2). The causes of 
pancreatitis include blockage of the pancreatic 
duct by gallstones, alcohol and certain drugs that 
cause acinar cell damage, medical procedures 
or trauma that damage pancreatic tissue, and 
autoimmune diseases (2). In approximately one- 
third of cases, the underlying etiology of the 
pancreatitis is unknown (idiopathic) (3, 4). Most 
acute pancreatitis cases will resolve with sup- 
portive care; however, up to 20% of patients will 
develop severe tissue damage and will either 


succumb to multiorgan system failure or suffer 
from bouts of recurrent disease with markedly 
diminished quality of life (2-4). Individuals with 
hereditary acute pancreatitis progress to chronic 
pancreatitis with a much higher penetrance and 
have a 40 to 55% lifetime risk of developing pan- 
creatic cancer (J, 5). Indeed, chronic pancreatitis 
promotes mutant Kras-mediated development of 
pancreatic cancer in mice (6). 

The glycan carbohydrate antigen 19-9 (CA19-9) 
is found in the serum of 10 to 30% of pancreatitis 
patients and in 75% of pancreatic cancer patients, 
as well as in patients with other gastrointestinal 
diseases (7-16). CA19-9 elevation is also detected 
in pancreatic intraepithelial neoplasms (PanINs), 
which are precursors to pancreatic ductal ade- 
nocarcinoma (PDAC) (17). CA19-9 [sialyl-Lewis* 
(sLe*)] is generated by the stepwise addition of 
sugar moieties to type I precursor chains present 


on proteins and other molecules, culminating in 
the o1,4 linkage of fucose to N-acetylglucosamine 
(fig. SIA). Fucosyltransferase 3 (FUT3) is the only 
enzyme with the ability to add fucose moieties 
through an a1,4 linkage and generate CA19-9. 
Mice lack this enzyme because Fut3 is a pseu- 
dogene in rodents (18, 19). 

To facilitate the discovery of PDAC biomarker 
candidates, we sought to create a mouse model of 
PDAC that recapitulated the elevation of CA19-9 
observed in human patients. This model would 
enable prioritization of biomarkers that out- 
perform CA19-9. Furthermore, changes to glyco- 
sylation often result in functional consequences. 
In this study, we investigated the role of CA19-9 
elevation in mouse and organoid models of pan- 
creatic disease. 


Recapitulation of CA19-9 elevation and 
regulation in cultured mouse PDAC cells 


To express CA19-9 in mouse cells, we transduced 
mouse PDAC cells with human FUTS. Expres- 
sion of FUT3 alone was insufficient for CA19-9 
production but did lead to increased levels of 
Lewis* antigens after removal of terminal galac- 
tose moieties present in rodents but not humans 
(Fig. 1A). The generation of the related Lewis* 
epitopes suggested that reprogramming of the 
precursor substrates would be necessary for the 
production of CA19-9 in pancreatic ductal cells. 
61,3-galactosyltransferase 5 (83GALT5) is required 
for the production of type I chain precursors (20), 
which serve as the precursors for the Lewis* 
modification (fig. SIA). Accordingly, expression 
of both FUT3 and B3GALT5 in mouse PDAC cells 
led to the cell surface expression of CA19-9 at 
levels equivalent to those observed in human 
cancer cell lines (COLO205, SUIT2) (Fig. 1B and 
fig. SIB). Comparable CA19-9 levels were observed 
in the blood of mice after orthotopic transplanta- 
tion of the CA19-9-expressing mouse and human 
cells (fig. S1C). 

To determine whether the murine PDAC cell 
proteins harboring CA19-9 modification are simi- 
lar to those in human PDAC cells, CA19-9 protein 
carriers were immunoprecipitated (IP) and iden- 
tified by mass spectrometry (MS) (fig. S2A and 
table S1). These analyses identified known CA19-9 
protein carriers in the FUT3- and B3GALT5- 
expressing mouse cells (n = 3; FC1199, FC1242, 
FC1245), including CD44, LGALS3BP, MUC1, and 
MUCS5AC (21-24). The human PDAC cell line, 
MiaPaCa-2, is CA19-9 negative, and therefore, 
we used it as a control to identify human CA19-9 
core proteins in the CA19-9-positive cell lines, 
Capan-2, SUIT2, and hM1-2D (fig. S2B and table 
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$2). We compared mouse and human CA19-9 
protein carriers and found that an average of 
72.3% (95% confidence interval 60.3 to 84.3%; 
n = 3) of the CA19-9-modified proteins identi- 
fied in all three human PDAC cell lines were 
also found in the engineered murine PDAC cell 
lines. Thus, expression of the human FUT3 
and B3GALT5 genes in mouse cells largely re- 
capitulates the human CA19-9 carrier profile 
(Fig. 1C). 


CA19-9 elevation in mice leads to acute 
and chronic pancreatitis 


We next generated a mouse model with induc- 
ible CA19-9 expression to study the effects of 
this glycan on pancreatic disease pathogenesis 
(fig. S2, C to E). Using PDX1-Cre (C), we restricted 
expression of the transgenes to the pancreas, 
duodenum, and bile duct. Cre-mediated excision 
of a LoxP-flanked stop (LSL) cassette enables 
ubiquitous expression of the reverse tetracycline- 
controlled transactivator 3 and mKate2 from the 
CAGS promoter within the ROSA26 locus (R'S") 
(25). Addition of doxycycline (Dox) to the diet ac- 
tivates the reverse tetracycline-controlled trans- 
activator, enabling FUT3, B3GALT5, and enhanced 
green fluorescent protein (eGFP) expression from 
the tetracycline response element promoter in the 
COLAI locus (F) (26). In addition, we excised the 
LSL cassette in the R allele to generate animals 
with Dox-inducible, whole-body CA19-9 expres- 
sion (R;F). Both C;R'S";F and R;F Dox-treated 
mice exhibited histologic signs of pancreatitis, 
including interstitial edema, lymphocyte infil- 
tration, and collagen deposition (Fig. 2A and fig. 
83, A and B). All other nonpancreatic tissues ap- 
peared histologically normal in both Dox-treated 
and untreated models and their littermate con- 
trols (figs. S4. and S5). The mice progressed from 
acute to chronic pancreatitis after 28 days of 
Dox treatment as demonstrated by the clinical 
histopathological hallmarks of acinar atrophy, 
accumulation of metaplastic ductal lesions, and 
persistent fibroinflammatory disease (figs. S3, A 
and B, and S6, A and B) (27). Pancreatitis was 
highly penetrant in both models (Fig. 2B and fig. 
S6, C and D). 

Both mouse models demonstrated elevated 
serum levels of the pancreatic enzymes amylase 
and lipase within 24 hours of Dox treatment, but 
no changes were observed in untreated mice and 
littermate controls (Fig. 2C and fig. S6, C and D). 
Patients with acute pancreatitis have elevated 
serum levels of amylase and lipase, but in the 
chronic phase of pancreatitis, the levels of these 
enzymes either normalize or decrease even fur- 
ther. We found in our mouse models that the levels 
of amylase and lipase began to normalize after 
4 weeks of Dox treatment, but both C;R'S";F 
and R;F mice still exhibited histologic signs 
of chronic pancreatitis. Expression of eGFP and 
CA19-9 was detected in the pancreas after Dox 
treatment of C;R'!;F mice and R;F mice, but 
they were not detected in untreated or control 
littermates (Fig. 2A and figs. S3C, $7, and S8). In 
both mouse models, CA19-9 was predominantly 
expressed in intralobular, intercalated, and meta- 
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plastic pancreatic ducts as well as in islet cells 
(Fig. 2A and figs. S3C, S7, and S8). Both models 
exhibited elevated CA19-9 levels in the circulation 
(Fig. 2D and fig. S6, C and D). Secreted CA19-9 
was also observed coating eGFP-negative endo- 
thelial cells as well as fibroblasts (figs. S7B and 
S8). E-selectin is an endogenous receptor of 
CA19-9 expressed by endothelial cells (28, 29) 
and may explain the accumulation of CA19-9 
within the vasculature. Despite recombination 
in the acinar compartment, acinar cells were 
rarely observed to be CA19-9 positive. R;F mice 
exhibited Dox-dependent expression of CA19-9 
and eGFP in all tissues examined, whereas in 
C;R'‘":F mice, expression was limited to the 
pancreas, duodenum, and gall bladder (Fig. 2A 
and figs. S3, C to G, and $7 to S11). 

We next sought to determine the prevalence 
of CA19-9 elevation in human pancreatic dis- 
ease and to compare the CA19-9 tissue expres- 
sion pattern observed in human patients and 
the CA19-9 genetically engineered mouse models. 
We therefore evaluated CA19-9 levels and expres- 
sion patterns in patients, including specimens 
of pancreatic cancer and adjacent normal tissue 
(n = '72) as well as surgically resected chronic 
pancreatitis samples (7 = 44) (figs. S12 and S13). 
CA19-9 was expressed at low levels in normal 
homeostatic pancreatic ducts and at elevated 
levels in adjacent reactive ducts that consist of 
atypical, benign pancreatic ducts that occur 
even in the absence of substantial inflammation 


(fig. S12A), preinvasive carcinomas (PanIN-1A 
and PanIN-1B), and invasive PDAC specimens 
(fig. S12, B to D) (77). In chronic pancreatitis 
specimens, CA19-9 was expressed at high levels 
in the reactive and metaplastic ducts with more 
sporadic expression in the centroacinar and aci- 
nar compartments (fig. $13, A and B, and table 
$3). Elevated levels of CA19-9 were detected in 
the serum of 20% of the chronic pancreatitis 
patients we examined, whereas more than 93% 
of these patients exhibited local elevation of 
CA19-9 in the resected pancreatic specimens 
by immunohistochemistry (IHC) (n = 44) (fig. 
$13, C and D). Together, these data indicate that 
CA19-9 elevation is a common feature of chronic 
pancreatitis and that the CA19-9 expression pat- 
tern is similar between the CA19-9 mouse models 
and human patients. 

Patients with severe acute or chronic pan- 
creatitis often also exhibit weight loss (30, 31). 
Therefore, we assessed the health and weight 
of both mouse models. Whereas most C;R'";F 
mice maintained their overall health status 
after Dox treatment, Dox-treated R;F mice ex- 
hibited significant body weight loss (fig. S14A). 
In severe cases, the weight loss exceeded 20%, 
and euthanasia was required. The weight loss was 
not due to pathology of the stomach, duodenum, 
or colon and was not associated with an auto- 
immune reaction, induction of endoplasmic re- 
ticulum stress in the pancreas, or loss of glycemic 
control (32) (fig. S14, B to D). Pancreatic exocrine 
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Fig. 1. FUT3 with B3GALT5 expression enables CA19-9 production in engineered mouse pan- 
creatic cancer cells. (A) Ectopic FUT3 induces Lewis* (Le*) but not CA19-9-sLe? expression in mouse 
PDAC cells by flow cytometric analysis. The Le: and CA19-9—sLe*—positive human cell line COLO205 
and Le*- and CA19-9-sLe*—negative parental KPC cell lines are shown. FITC, fluorescein isothio- 
cyanate; FSC-A, forward scatter—area; AF568, Alexafluor 568. (B) CA19-9 flow cytometry of mouse 
PDAC cells stably and constitutively expressing FUT3 with B3GALT5 (FB) compared with the isotype 
control antibody. (C) Overlap between CA19-9 protein carriers identified in three out of three human 
PDAC cell lines (n = 926) with three independent mouse PDAC cell lines expressing FUT3 and B3GALT5. 
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insufficiency is often observed in cases of chronic 
pancreatitis and can cause severe weight loss 
and destruction of the acinar compartment in 
patients (37). R;F mice exhibited a significant 
Dox-dependent decrease in pancreatic mass, 
whereas no change was observed in C;R'!;F 
mice and littermate controls after Dox treatment 
(fig. S14E). In addition, we observed a signifi- 
cant and sustained increase in steatorrhea in 
R;F mice, whereas C;R'";F mice exhibited a 
transient increase in fecal triglycerides that 
normalized within 4 weeks of Dox treatment 
(fig. S14F). The pancreatic atrophy and steator- 
rhea in R;F mice was accompanied by a sig- 
nificant reduction of fecal elastase in R;F mice 
within the first week of Dox treatment (fig. S14G). 
Together, these data indicate that R;F mice suffer 
from pancreatic atrophy and exocrine insuffi- 
ciency. Both the increased penetrance of CA19- 
9 expression in the Cre-independent R;F model 


28 days of Dox 7 days of Dox 3 days of Dox 0 days of Dox = 


and expression of CA19-9 in other tissues may 
contribute to the severe pancreatitis observed 
in this model. 

Human pancreatitis is associated with an in- 
flux of macrophages to the pancreas during the 
acute phase of the disease; after progression to a 
chronic condition, T and B cells also infiltrate 
the pancreas (33, 34). To determine whether 
this aspect of human pancreatitis is also found 
in mouse models of pancreatitis, we examined 
the immune infiltrate in C;R'*";F mice and the 
established cerulein model of acute pancreatitis. 
Flow cytometry revealed an influx of immune 
cells into the pancreas, including recruitment 
of inflammatory monocytes and macrophages, 
after induction of pancreatitis (Fig. 2E and figs. 
$15 and S16A). This result was confirmed by IHC 
(fig. S16B). Neither model showed significant re- 
cruitment of T or B cells at the acute pancreatitis 
time points examined by flow cytometry (1 to 
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3 days) (fig. S15). At the chronic phase of pan- 
creatitis, both R;F and C;R'S’;F models contained 
intrapancreatic T and B cells as indicated by IHC 
(fig. S17). Therefore, both the CA19-9 and cerulein 
mouse models of pancreatitis exhibited immune 
infiltrates similar to those found in patients with 
pancreatitis. 

Pancreatic cell proliferation is a common fea- 
ture of human pancreatitis (35). Cerulein-treated 
mice exhibited an elevation in Ki67-positive nuclei 
in the infiltrating immune cells as well as in the 
ductal and acinar compartments. Similarly, both 
R;F and C;R““";F mice exhibited increased Ki67- 
positive nuclei in the immune, acinar, and ductal 
compartments in the acute phases of pancrea- 
titis, which gradually diminished as the mice 
progressed to chronic pancreatitis (fig. $18, A 
to E). Increased ethynyl deoxyuridine (EdU) in- 
corporation was also observed in the pancreas, 
including elevation in the immune, ductal, and 
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Fig. 2. CA19-9 expression promotes pancreatitis in mice. (A) Histologic 
evaluation of C;R'S“:F mice by hematoxylin and eosin (HE) staining, Masson's 
trichrome staining (MT) (blue indicates collagen deposition), and CA19-9 
expression (open arrow, CA19-9* duct; closed arrow, CA19-9* islet) by IHC 
after treatment with Dox. Scale bars = 50 um. (B) Quantification of the 
percentage of pancreatic area exhibiting histologic signs of pancreatitis after 
treatment of C:R'S“-F mice with Dox. (C) Circulating levels (U/L) of the 
pancreatic enzymes amylase and lipase in C:R'S“-F mice after treatment with 
Dox (days). The dotted line indicates the threshold elevation required for 
the diagnosis of pancreatitis. The dashed line indicates the maximum level of 
detection possible for amylase. (D) The circulating level of CA19-9 (U/ml) 
after treatment of C;R'S":F mice with Dox. Values that exceed 37 U/ml (dotted 
line) are elevated. (E) Immune cell infiltration evaluated by flow cytometry 
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in mice treated with phosphate-buffered saline (PBS) or cerulein (Ceru) for 
2 days followed by a 1- or 3-day recovery period (C57BI/6j; n = 5, 5, and 

5, respectively) and C:R'S“:F mice treated with Dox (n = 4, 4, and 6, 
respectively) compared with “genetically negative” controls (GN) (n = 3 

and 3, respectively). Outlier analysis using Grubb's method identified one data 
point (triangle symbol, GN, 3 days of Dox) as an outlier. (F) EdU incorporation 
in the pancreas was evaluated by flow cytometry in C:R'S“F mice (n = 3, 3, 
and 4, respectively) and littermate GNs (n = 3 and 2, respectively) after 
treatment with Dox. Middle horizontal red lines represent the mean, and error 
bars represent the standard deviation; each data point represents a 
measurement from an individual mouse. *P < 0.05, **P < 0.01, ***P < 0.001, 
***P < 0.0001 for multiple comparisons using Holm-Sidak’s procedure 
following a one-way analysis of variance. 
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fibroblast compartments (Fig. 2F and fig. S18F). 
Together, these data demonstrate that CA19-9- 
mediated pancreatitis in mice bears similarity 
to the human disease. 


CA19-9 expression results in 
hyperactivation of epidermal growth 
factor receptor signaling 


To identify the molecular mechanisms underly- 
ing the pancreatitis phenotype observed in mice, 
we focused our studies on the C;R'";F model. 
Other researchers have established that epider- 
mal growth factor receptor (EGFR) signaling is 
necessary and sufficient for the induction of 
pancreatitis and is important for the initiation of 
pancreatic cancer in mice (36, 37). Accordingly, 
we found that the levels of tyrosine phosphoryl- 
ated and activated EGFR were prominently ele- 
vated in the pancreatic ducts of C;RS“.F mice 
after Dox treatment (Fig. 3A). To identify the al- 
terations in cell signaling pathways that occur in 
direct response to CA19-9 expression, we isolated 
pancreatic ductal organoids from C;R'*";F mice 
(38). RNA-sequencing analyses of the C;R".F 
organoids after induction of CA19-9 expression 
revealed the expected changes in transgene ex- 
pression (fig. S19, A to F, and table S4). FUT3 
and B3GALT5 expression levels were compara- 
ble between mouse and human ductal organo- 
ids (fig. S19D) (39). 

Gene set enrichment analysis identified sig- 
nificant changes to 21 pathways in the hallmarks 
of cancer gene sets and 64 pathways annotated 
in the KEGG (Kyoto Encyclopedia of Genes and 
Genomes) pathway gene sets (Fig. 3B and table 
$4). CA19-9 expression triggered a reduction in 
expression of genes associated with the unfolded 
protein response and an increase in expression 
of genes associated with chemokine, hedgehog, 
Janus kinase-signal transducers and activators of 
transcription, and transforming growth factor-B 
signaling. The latter genes may participate in 
the activation of the fibroblast compartment 
and recruitment of the immune infiltrate (e.g., 
interleukin-la, colony-stimulating factor 1, tumor 
necrosis factor). We further explored enrichment 
of the extracellular matrix (ECM)-receptor in- 
teraction, ERBB, phosphatidylinositol 3-kinase 
(PI3K), and AKT signaling pathways (Fig. 3B). 
CAI19-9 expression in C;R™";F organoids resulted 
in elevated EGFR phosphorylation (Y1068 and 
Y1148) and decreased total EGFR, indicating 
greater flux through this pathway (40) (Fig. 3C 
and fig. S20, A, B, and E to G). The increase in 
phosphorylated EGFR was accompanied by 
increased phosphorylation of focal adhesion 
kinase (FAK) (Y397), AKT, and extracellular 
signal-regulated kinase 1 and 2 (ERK1/2) (Fig. 
3C and fig. S20F). Coincident FAK and EGFR 
activation through integrin-mediated interaction 
with the ECM has been previously reported in 
three-dimensional but not monolayer culture 
models (4/, 42). EGFR stimulation by EGF was 
more rapid and robust in organoids that were 
previously cultured in EGF-free conditions, but 
EGF stimulation did not induce FAK phospho- 
rylation (fig. S20, C and D). Although the al- 
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Fig. 3. CA19-9 expression activates EGFR signaling. (A) Representative phosphorylated EGFR IHC 
in C;R'S“:F mice that were treated for O (n = 4) or 3 (n = 5) days with Dox. Insets are higher magnification 
of pancreatic ducts. Scale bars = 50 um. (B) Gene Set Enrichment Analysis of C;R'S“:F organoids 
identified enrichment in PISK/AKT/mTOR and ECM-receptor signaling and downstream effector path- 
ways. FDR, false discovery rate; NES, normalized enrichment score. (C) C:R'S“F organoids (n = 3 biological 
replicates) and GN organoids (n = 2 biological replicates) were evaluated by immunoblot for the activation 
of signaling pathways after treatment with Dox. Blots are representative of three technical replicates, 
three biological replicates of C:R'S“F organoids, and two biological replicates of GN organoids. p, 
phosphorylated; t, total. (D) Quantification of changes to the ratio of phosphorylated to total protein is 
shown for Fig. 3, E to G. (E and F) Organoids from GN littermates were incubated with conditioned 
media (CM) from C;R'S“:F organoids with or without previous Dox treatment (24 hours) in the presence 
or absence of Fe control or EGFR-conjugated Fe and evaluated by immunoblot. (G) C;R'S“:F organoids 
(n = 3 biological replicates) were treated with Dox (hours) in the presence of isotype control 
(MOPC-21) or CA19-9-blocking monoclonal antibody (NS19-9) and evaluated by immunoblot. 
Blots are representative of two technical and three biological replicates. 


terations of phosphorylated EGFR, total EGFR, 
and their ratio were consistent among biological 
replicates, no consistent changes to total or phos- 
phorylated S6, HER2, and p65 were observed 
(fig. S20, A and B). These findings were inde- 
pendent of the inclusion of murine EGF in the 
media of the cultures (fig. S20E), and the level 
of phospho- and total EGFR and downstream 
effector induction exceeded the increase ob- 
served after treatment with additional EGF 
(fig. S20, B to D). 

The glycosylation state of EGFR has been 
reported to affect its ability to activate and to 
respond to targeted kinase inhibitors (43, 44). 
Accordingly, we examined whether EGFR was 
modified by CA19-9 in mouse organoids, but 
this was not detected (fig. S20H). To investigate 
whether the CA19-9-dependent activation of 


EGFR was due to a soluble ligand, we evaluated 
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conditioned medium from CA19-9-expressing 
organoids and found that it stimulated EGFR 
phosphorylation in control murine ductal organ- 
oids (Fig. 3, D to F). This activity was attenuated 
by the addition of EGFR-Fc (EGF trap), further 
supporting the presence of one or more EGFR 
ligands (Fig. 3, D and F). Addition of a mono- 
clonal antibody to CA19-9 (NS19-9) was also 
sufficient to block EGFR phosphorylation in 
C;R'S*F organoids (Fig. 3, D and G). These 
findings suggested the presence of a CA19-9- 
modified, secreted EGFR ligand. 

EGFR complexes from C;R‘*";F organoid 
lysates were identified by IP-MS (Fig. 4A and 
table S5). We observed elevated levels of endog- 
enous fibulin-3 [Efemp1 (FBLN3)] by IP-MS in 
C;R'S";F, but not control organoids, after Dox 
treatment (Fig. 4A). FBLN3, a secreted matri- 
cellular glycoprotein with five EGF-like domains, 
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has been proposed to be a ligand for EGFR (45). 
Furthermore, we performed CA19-9 IP-MS from 
the conditioned media using two different CA19-9 
monoclonal antibody clones and identified FBLN3 
as a secreted, CA19-9-modified protein (Fig. 4B, 
fig. S21A, and table S6). 

To confirm the glycosylation status of FBLN3, 
we coexpressed FLAG epitope-tagged FBLN3 in 
the presence or absence of FUT3 and B3GALT5 


in mouse PDAC cells and detected CA19-9 modi- 
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fication of secreted FBLNS (fig. S21B). The mRNA 
expression of fibulin family members and EGFR 
ligands did not change after Dox treatment of 
C;RS|.F organoids (fig. S21C). Furthermore, the 
total protein levels of FBLN3 were unchanged 
after CA19-9 expression in organoid-conditioned 
media and in mouse plasma in vivo (fig. S21D). 
Secreted FLAG-FBLN3 isolated by immunopre- 
cipitation coprecipitated with EGFR in PDAC 
cell lysates, consistent with an association of 
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Fig. 4. CA19-9-modified fibulin-3 activates EGFR. (A) IP-MS of EGFR complexes from C:R'S“F 

(n = 3) and GN organoid (n = 2) whole-cell lysates after Dox administration (8 hours). (B) CA19-9 protein 
carrier IP-MS analysis of fibulin-3 from conditioned media from C:R'S“F organoids treated with Dox 
for O to 8 hours (n = 3) relative to untreated and treated GN organoids (n = 1) using 5B1 CA19-9 antibody 
clone. (C) Immunoblot of IP fibulin-3 (FLAG) after incubation with lysates from pancreatic cancer 

cells lacking FLAG-FBLN3 expression. ISO, isotype control antibody. (D) C;R'S“:F organoids (n = 3) 
treated with Dox in the presence of MOPC-21 or three independent fibulin-3 antibodies (FBLN3 Ab) and 
evaluated by immunoblot. Changes to the ratio of phosphorylated to total protein levels are included. 
mAb, monoclonal antibody; pAb, polyclonal antibody. (E) Organoids transduced with hairpins to GFP or 
to FBLN3 were immunoblotted after Dox treatment. Changes to the ratio of phosphorylated to total 
protein levels are included. Middle horizontal red lines represent the mean, and error bars represent the 
standard deviation. *P < 0.05 by unpaired, two-tailed t test. 
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FBLN3 with EGFR (Fig. 4C). Because the asso- 
ciation of ectopic FLAG-FBLN@ with EGFR in cell 
lysates occurs irrespective of its CA19-9 glycosyl- 
ation status, the finding that the glycosylation 
status of endogenous FBLN3 increases associ- 
ation with EGFR may reflect the interaction of 
sLe*-modified FBLN3 with additional extracel- 
lular proteins. To determine whether FBLN3 is 
necessary for the activation of the EGFR pathway 
after CA19-9 expression, multiple FBLN3-blocking 
antibodies and short-hairpin RNA (shRNA) vectors 
were utilized and shown to prevent EGFR phos- 
phorylation and the activation of downstream 
effector pathways in C;R'*";F organoids (Fig. 4, 
D and E£, and fig. $21, E and F). 


CA19-9 as a therapeutic target 
in pancreatitis 


We designed a Dox pulse-chase approach to 
determine if CA19-9 expression is required to 
maintain pancreatitis. CA19-9-mediated pancre- 
atitis was completely reversed in C;R'“";F mice 
after a 3-day Dox pulse and 4-day recovery period 
(fig. S22, A to C). Partial resolution of chronic 
pancreatitis in the C;R'S":F model (28-day Dox 
treatment) was also observed after a 14-day re- 
covery period (fig. S22, D and E). In a preventive 
setting of acute pancreatitis, two antibodies di- 
rected against CA19-9 both reduced immune in- 
filtration, ductal metaplasia, and fibrosis in vivo 
(Fig. 5A and fig. S22, F and G). In addition, 
decreased release of amylase and lipase into 
the circulation and reduced hyperactivation of 
EGFR were observed (Fig. 5, B and C, and fig. 
$22, H and I). In an intervention setting of 
existing acute pancreatitis, we found that two 
forms of 5B1 significantly reduced the secretion 
of amylase into the circulation with modest 
normalization of the pancreatic histology (fig. 
$22, J to L). CA19-9 antibody treatment of exist- 
ing acute pancreatitis also reduced the levels of 
phosphorylated EGFR in both the ductal and 
acinar compartments and decreased recruit- 
ment of macrophages (fig. S22, M and N). These 
data suggest that CA19-9 plays a role in disease 
pathogenesis and maintenance and that CA19-9- 
targeted therapy may warrant further therapeu- 
tic exploration. 

Inhibition of EGFR in vivo by using erlotinib 
was not as effective as CA19-9 antibody blockade 
to mitigate pancreatitis induction in mice (fig. 
$23, A to F). Indeed, erlotinib treatment of CA19- 
9-expressing mice caused severe weight loss, 
necessitating euthanasia. These effects were 
unrelated to erlotinib toxicity in control mice. 
Although serum levels of amylase and lipase 
decreased in erlotinib-treated animals, pancre- 
atic atrophy and acinar cell vacuolization were 
observed, suggesting that the weight loss were 
due to increased pancreatitis severity and result- 
ing exocrine insufficiency (fig. S23, B and C). 
Erlotinib treatment also incompletely blocked 
phospho-EGFR levels in vivo (fig. S22D). Acinar- 
ductal metaplasia (ADM) can be detected by SOX9 
IHC (36) and occurred after 7 days of Dox treat- 
ment in the C;R“";F model (fig. S22E). Although 
ADM was less apparent in the erlotinib-treated 
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C;R“";F mice (fig. S22F), lymphocyte infiltra- 
tion and fibrosis remained unaffected. CA19-9 
sequestration is unlikely to interfere with ADM 
survival mechanisms in the acinar compart- 
ment, given that these cells are largely CA19-9 
negative (46). Therefore, EGFR kinase inhibi- 
tion alone cannot substitute for CA19-9 blockade 
and may be a harmful therapy in the setting of 
pancreatitis. 


5B1 B 


Pancreatic tumorigenesis is accelerated 
by CA19-9-—mediated pancreatitis 

To determine whether CA19-9 expression pro- 
moted PDAC, we intercrossed the CREF alleles 
with the conditional Kras'““”? allele (K;C;R'“";F) 
(fig. S24A) (47, 48). CA19-9 expression signifi- 
cantly accelerated pancreatic cancer lethality 
relative to untreated mice and control littermates 
(Fig. 6A). When treated with Dox, K;C;R'";F 


Amylase 


mice rapidly succumbed to primary and meta- 
static pancreatic cancer with a median survival 
of 202 days relative to 460 days in the K;C 
control cohort and 420 days in the untreated 
K;C;R"™!;F cohort. The primary tumors were 
anaplastic with glandular features (Fig. 6B). 
Widespread metastases were observed in the 
peritoneum, diaphragm, liver, and lung in mul- 
tiple Dox-treated K;C;R'";F mice. To better 


Fig. 5. CA19-9 as a therapeutic target for the 


treatment of pancreatitis. (A) HE staining of 


representative areas from C;R'S“F mice after 


treatment with human isotype control (hlgG1; 
n=5)or the CA19-9 antibody clone 5B1 (n = 5) for 
8 days and Dox for the last 7 days. Scale bars = 
100 um. (B) Serum amylase and lipase levels 
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* 


in C;R'S“-F mice treated with 5B1 or isotype 
control. (©) Phospho-EGFR IHC on representative 
mice treated with either isotype or 5B1 as 
described in Fig. 5A. Scale bars = 50 um. Middle 
horizontal red lines represent the mean, and 
error bars represent the standard deviation; each 
data point represents a measurement from an 
individual mouse. P value was determined by 
using an unpaired, parametric, two-tailed t test. 
*P < 0.05, **P < 0.01. 
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Fig. 6. CA19-9 promotes rapid 
and aggressive pancreatic 
tumorigenesis. (A) Survival curve 
for untreated (eight deaths out 

of 19 mice) and Dox-treated 
K;C:R'S“F mice (14 deaths out of 
19 mice) and K;C GNs (13 deaths 
out of 34 mice). The P value was 
determined by a log-rank Mantel-Cox 
test. (B) Representative histology 

of pancreatic tumors and metastatic 
lesions from Dox-treated K;C;R'S“:F 
mice. Scale bars = 200 um. 

(C) Representative histology 
(scale bars = 100 um), (D) CA19-9 
IHC (scale bars = 100 wm), and 
(E) pEGFR IHC (scale bars = 

50 um) of the pancreata from 

K:C and K;C;R'S“:F mice after 2 to 
4 weeks of Dox treatment. 
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understand the role of CA19-9 in PDAC initia- 
tion, we examined the effect of short-term CA19-9 
expression on pancreatic transformation. Whereas 
littermate controls exhibited the expected low 
burden of mPanIN-1A lesions, CA19-9-expressing 
animals harbored a high penetrance of cystic 
and fibroinflammatory disease with abundant 
mPanIN-1B and occasional mPanIN-2 lesions 
after 2 weeks of Dox (Fig. 6C) and increased mac- 
rophage infiltration relative to the K;C control 
cohort (fig. S24, B and C). After 4 weeks of Dox, 
cystic papillary neoplasia and invasive carcinoma 
could be detected (Fig. 6C). CA19-9 expression 
was elevated in normal, benign-reactive, and 
metaplastic ducts as well as in mPanIN and 
PDAC lesions in K;C;R“";F mice, similar to the 
human expression pattern (Fig. 6D and fig. S12). 
Equivalent levels of CA19-9 were also observed in 
human PDAC and mouse K;C;R“";F organoids 
(fig. S24D). Phosphorylated EGFR (Fig. 6E) was 
detected at high levels in Dox-treated K;C;R'S":F 
mice, whereas CA19-9-negative K;C mice exhib- 
ited low to negative levels of phospho-EGFR. 


Discussion 


CA19-9 is expressed at low levels in the normal 
ducts of the pancreas but becomes elevated in 
benign-reactive, metaplastic, and malignant ducts 
in humans. It is conceivable that the degree of 
CA19-9 elevation could impart a way to control 
the degree of fibroinflammatory response in both 
pancreatitis and PDAC. Mice retain the ability to 
respond to elevations in CA19-9 despite the fact 
that they do not normally express this glycan. 
The ability of mice to respond to CA19-9 eleva- 
tion may be due to the existence of similar regu- 
latory mechanisms for related Lewis antigens 
(e.g., Sialyl-Lewis*) that are involved in similar 
processes in other organs and have also been 
shown to substitute for CA19-9 in individuals 
lacking this glycan (49). 

Patients with pancreatitis have an elevated 
risk (2.7- to 16.5-fold) of developing PDAC, and 
individuals with hereditary pancreatitis have a 
40 to 55% lifetime risk of developing pancreatic 
cancer (1, 5). Therapeutic options for pancreatitis 
patients are now focused on treating the symp- 
toms, and little can be done to facilitate the reso- 
lution of idiopathic pancreatitis or to prevent its 
recurrence, highlighting the need for new treat- 
ments. Prophylactic intervention could also be 
beneficial in the setting of recurrent or hereditary 
pancreatitis and after certain routine procedures 
for which pancreatitis is a common outcome. For 
example, 3.5% of the >700,000 patients under- 
going endoscopic retrograde cholangiopancrea- 
tography each year in the United States will 
develop pancreatitis (50, 51); several risk fac- 
tors can be used to identify patients at elevated 
risk for endoscopic retrograde cholangiopan- 
creatography-associated pancreatitis (40%) (52). 
Fully human CA19-9 antibodies have passed 
phase 1A clinical trials for positron emission 
tomography imaging of pancreatic cancer (53). 
This could potentially facilitate rapid translation 
of CA19-9-targeted therapy to the clinic for the 
treatment of pancreatitis. 


Engle et al., Science 364, 1156-1162 (2019) 
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Quantum amplification of mechanical 


oscillator motion 


S. C. Burd»?*, R. Srinivas”, J. J. Bollinger’, A. C. Wilson’, D. J. Wineland»””*, 
D. Leibfried’, D. H. Slichter’, D. T. C. Alleock””? 


Detection of the weakest forces in nature is aided by increasingly sensitive 
measurements of the motion of mechanical oscillators. However, the attainable 
knowledge of an oscillator’s motion is limited by quantum fluctuations that exist even 

if the oscillator is in its lowest possible energy state. We demonstrate a technique 

for amplifying coherent displacements of a mechanical oscillator with initial magnitudes 
well below these zero-point fluctuations. When applying two orthogonal squeezing 
interactions, one before and one after a small displacement, the displacement is 
amplified, ideally with no added quantum noise. We implemented this protocol with 

a trapped-ion mechanical oscillator and determined an increase by a factor of 

up to 7.3 (£0.3) in sensitivity to small displacements. 


echanical oscillators are essential com- 
ponents in an increasing variety of pre- 
cision sensing applications, including 
gravitational wave detection (J), atomic 
force microscopy (2), cavity optome- 
chanics (3), and measurement of weak electric 
fields (4). Quantum mechanically, any harmonic 
oscillator can be described by a pair of noncom- 
muting observables; for a mechanical oscillator, 
these are typically position and momentum. The 
precision of measurement of these observables 
is limited by unavoidable quantum fluctuations 
that are present even if the oscillator is in its 
ground state. Using the method of “squeezing,” 
these zero-point fluctuations can be manipu- 
lated while preserving their product as dictated 
by the Heisenberg uncertainty relation. This 
squeezing allows for improved measurement 
precision for one observable at the expense of 
increased fluctuations in the other (5). 
Although squeezed states have been created in 
a variety of physical systems, including electro- 
magnetic fields (6), spin systems (7), micromechan- 
ical oscillators (8-10), and the motional modes 
of single trapped ions (J/, 12), exploiting squeezing 
for enhanced metrology has been challenging. 
In particular, noise added during the detection 
process will limit the metrological enhancement 
unless it is smaller than the squeezed noise. The 
requirement of low-noise detection can be over- 
come by increasing the magnitude of the signal 
to be measured. In optical interferometry (13) 
and in spin systems (74), it has been shown that 
reversal of squeezing interactions can magnify 
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small phase shifts, thereby relaxing detection 
requirements (15). Photon field displacements 
in microwave cavities have also been amplified 
using similar phase-sensitive amplification schemes 
(16, 17). However, in mechanical oscillator sys- 
tems, technical challenges in implementing re- 
versible squeezing interactions have prevented 
prior use of such methods. 

We present a protocol, based on reversible 
squeezing, for ideally noiseless phase-sensitive 
amplification (5) of mechanical oscillator dis- 
placements. This amplification method (Fig. 1) 
is applicable to any harmonic oscillator where 
reversible squeezing can be implemented faster 
than system decoherence. By first squeezing the 
motional ground state, quantum fluctuations 
along a particular phase space quadrature are 
suppressed. A small initial displacement o; (to 
be amplified) is then applied along the squeezed 
axis. At this stage, although the signal-to-noise 
ratio (SNR) for measuring o; has been improved 
by squeezing, resolution below the zero-point 
fluctuations would require a detection method 
with yet lower noise. Finally, by reversing the 
squeezing interaction, the oscillator returns to 


Im(q) 


S = 6B@) 


a minimum-uncertainty coherent state with a 
larger amplitude of = Go, where G is the gain. 
Ideally, this process adds no noise in either quad- 
rature. For an oscillator described using crea- 
tion and annihilation operators a’ and a, the 
amplification is given by the identity 


D(as) = S"(&)D(ai)S (6) (1) 


(18), where D(a) = exp(aa" — a*G) is the displace- 
ment operator, and $(&) = exp[(E*a? — a") /2] 
is the squeezing operator with complex squeezing 
parameter &(7, 6) = 7 exp(0). For arbitrary orienta- 
tions of the displacement o,; with respect to the initial 
squeezing axis, of = 0; cosh(r) + o;* exp(z8) sinh(r). 
Maximum amplification is achieved if the displace- 
ment is along the squeezed axis where arg(a;) = 
0/2, giving G = exp(r). 

We demonstrate this technique using a single 
trapped Mg" ion as the mechanical oscillator 
(19). The ion is held ~30 um above a linear surface- 
electrode radio-frequency trap (20, 21), which 
is cryogenically cooled to 18 K. Experiments are 
performed on a radial motional mode of the ion 
with frequency @, ~ 2m x 6.3 MHz, energy eigen- 
states denoted by |7), and zero-point wave func- 
tion extent of ~5.7 nm (19). To analyze the motional 
state, we use qubit states ||) =|F = 3, mp =1) and 
|t) = |F = 2, mp = 1) within the So electronic 
ground-state hyperfine manifold, where F is the 
total angular momentum and mr is its projection 
along the direction of the quantization magnetic 
field of approximately 21.3 mT. The qubit transi- 
tion frequency @ = 2n x 1.686 GHz is first-order 
insensitive to magnetic field fluctuations, giving 
a qubit coherence time longer than 200 ms (27). 
The qubit state can be manipulated with reso- 
nant microwave carrier pulses. In each experi- 
ment, the ion is initialized in the electronic and 
motional ground state | |)|0) with optical pump- 
ing, resolved-sideband laser cooling (22), and 
microwave pulses. Qubit readout is accomplished 
by applying a laser resonant with the *S1/. <> "P3/o 
cycling transition and detecting state-dependent 
ion fluorescence. We analyze the motional state 
of the ion by applying sideband interactions to 
map it onto the qubit states (1/7, 12). Applying a 
sideband interaction for various durations re- 
sults in qubit Rabi oscillations with multiple fre- 
quency components whose amplitudes depend 
on the Fock state populations. We generate these 


> : ; > 
squeeze 


a 
/ ~ 
displace ; anti- y 


squeeze 


Fig. 1. Conceptual illustration of the amplification protocol. Each panel shows a Wigner 
function phase space distribution (not to scale) in a frame rotating at the oscillator frequency. 
A displacement a; of an initially squeezed ground state is amplified by subsequent reversed 
squeezing (“anti-squeezing”), resulting in a final coherent state with amplitude Ga; with 

no added noise. Dashed red circles indicate the characteristic extent of the initial 


ground-state fluctuations. 
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interactions using oscillating magnetic field grad- 
ients (27). The blue sideband (BSB) interaction 
induces transitions between the states ||) |) and 
|t)| + 1) with Rabi frequencies proportional to 
Vn -+ 1. The red sideband (RSB) interaction drives 
transitions between ||) |72) and |t)|7-1) with Rabi 
frequencies proportional to \/n and will not cause 
a qubit transition if the ion is in ||)|0). 

Squeezing of the motional state is accomplished 
by applying an oscillating potential at twice the 
motional frequency (2w,.) to the radio-frequency 
electrodes of the trap (23). This modulates the 
confining potential for the ion, yielding the in- 
teraction picture Hamiltonian 


T= ing [a? exp( ie) — a’? exp(i8)| (2) 
(19), where hf is the Planck constant divided by 
2n, gis the parametric coupling strength, and 0 is 
the phase of the parametric modulation. Apply- 
ing this Hamiltonian for duration ¢ implements 
the unitary squeezing operator S() with 7 = gt. 
Although electronic parametric modulation has 
been used with single ions to squeeze a thermal 
state of motion (24) and for phase-sensitive para- 
metric amplification of highly displaced thermal 
states (25), it has not previously been imple- 
mented on pure quantum states. Optical forces 
can also be used for parametric modulation (7), 
but decoherence due to photon scattering and 
higher-order nonlinearities in the optical field 
have limited the achievable squeezing (77). Squeezed 
mechanical oscillator states can also be prepared 
using dissipative reservoir engineering (8, 12). 
However, this is not a unitary squeezing oper- 
ation, as is required for the amplification meth- 
od described here. 

We characterize our squeezing process using 
motional sideband analysis to extract Fock state 
populations (79) (Fig. 2). To characterize the 
unitarity of our squeezing operations, we mea- 
sure the ground-state population after squeezing 


Fig. 2. Fock state population analysis. 

(A to C) Histograms of Fock state populations 
extracted by fitting to BSB Rabi oscillations. 
Vertical bars are derived by fitting to an 
unconstrained population distribution. Solid blue 
lines are fits assuming parameterized functional 
forms of the ideal Fock state populations, yielding 
values of r, oj, and of (19). Insets show Wigner 
function illustrations of the corresponding 
motional states. (A) Initial squeezed motional 
ground state with r = 2.26 (+0.02). (B) After 
displacing this state by o; = 0.200 (40.002). 

C) Final coherent state with amplitude of = 1.83 
+0.01), following the reversed squeezing 
operation. The initial displacement is amplified 
by G = a;/o; = 9.17 (£0.09). (D) Squeezing 
parameter r (black circles) as a function of the 
parametric drive duration. The solid line is a linear 
fit whose slope gives the parametric coupling 
strength g. (E) Measured gain (black circles) as a 
function of the squeezing parameter r. The solid 
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and anti-squeezing, (0|S*S|O). For r < 2, this pop- 
ulation is ~0.98, which is consistent with the mea- 
sured value without squeezing and anti-squeezing 
(r = 0). The population in n = 0 remains above 
0.93 for r < 2.37 (+0.03), or 20.6 (+0.3) dB of 
squeezing (19). The calibrated parametric coupling 
strength is g = 27 x 50.2 (+0.2) kHz, equivalent 
to a squeezing rate of 2.75 (+0.02) dB/us. 

We use this unitary squeezing interaction to 
demonstrate amplification of harmonic oscil- 
lator displacements (see Fig. 1). Displacements 
are implemented by applying an oscillating 
potential resonant with the motional mode (at 
frequency ,.) to an electrode of the ion trap 
(79). All control fields are digitally synthesized 
with the same reference clock, enabling stable 
and deterministic control of the relative phases 
between the displacement, squeezing, sideband, 
and carrier interactions. At each stage of the 
amplification process, we verify the Fock state 
composition of the ion’s motional state using 
sideband analysis (Fig. 2, A to C). The measured 
gain for various values of the squeezing param- 
eter closely follows the theoretically expected 
exponential growth of the coherent state am- 
plitude (Fig. 2E). 

Using this amplification technique, we achieve 
increased sensitivity when measuring displace- 
ments much smaller than the zero-point fluctua- 
tions. To map the final displacements o, onto the 
qubit states, we use a phase-sensitive red side- 
band (PSRSB) method (26) (Fig. 3A), which reaches 
the standard quantum limit for |o,| « 1 (19). 
Here, a displacement of the motional ground 
state results in a probability of measuring ||) of 
P, = Wf - C(log|) cos 6], where C(|a,|) is the 
signal contrast and 9 is the phase of a carrier 
m/2 pulse, which follows the RSB x mapping 
pulse. For |a,¢| « 1, C(ag) = 2|a¢|. In comparison, 
simply measuring the qubit directly after the 
RSB rx pulse gives a signal P, lo,|?. Without 
amplification, o, = 0; and the PSRSB contrast is 


0.2 


c 
oO 


0.0 


ine is the theoretical gain G = exp(r). Error bars denote SD. 


C(\o;|). With amplification, the initial displace- 
ment amplitude o; is ideally increased by a factor 
of G and the PSRSB contrast becomes C(|Goj|). 
This increase in contrast is shown in Fig. 3B, 
where the presence of oscillations for the state 
after amplification indicates that it has a well- 
defined motional phase. The carrier phase de- 
pendence in this figure is a feature of the PSRSB 
method, not of the amplification protocol. Figure 
38C highlights the phase-sensitive nature of the 
amplification protocol by plotting the contrast C 
of the PSRSB fringe against the squeezing phase 
8 for a fixed displacement. Maximum amplifica- 
tion is achieved when the displacement is oriented 
along the squeezed axis of the initial squeezed 
state in motional phase space (see Fig. 1). Figure 
3D shows the measured signal contrast as a 
function of |a;| for various parametric drive 
durations. For each displacement, the contrast 
is defined as C = Py max - P| min, Where P) max 
and P) min are the maximum and minimum, re- 
spectively, of the fringes shown in Fig. 3B. The 
uncertainty in measuring the contrast is o(C) = 
/6(P| max)? + 6(P| min)2, where o(P) maxtminy) is 
the variance of the projection noise associated 
with measuring P, maxminy- The SNR for a dis- 
placement measurement is then s(G) = C(Ga;,)/ 
o[C(Go;)]. For a given number of experiments, 
amplification allows the SNR for a displacement 
measurement to be improved in comparison to 
the ideal PSRSB measurement with no squeezing 
(Fig. 3D, black solid line), giving a measurement 
sensitivity enhancement of s(G)/s(G = 1). For 
measurements where C < 0.25, the contrast 
varies linearly with |o,|, and the gain in contrast 
C(Ga;)/C(a;) sets a lower bound (which becomes 
exact as |a;| > 0) on the measurement sensitivity 
enhancement, because the projection noise de- 
creases monotonically with increasing contrast. 
Increasing the squeezing results in increased con- 
trast for |a,| < 1, up to a squeezing time of 
approximately 8 us [corresponding to r = 2.54 
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Fig. 3. Measurement sensitivity enhancement. (A) Pulse sequence for displacement sensing 
protocol with PSRSB detection. (B) Population in |{) as a function of the carrier x/2 pulse 
phase. Blue inverted triangles, data with no squeezing; red circles, data with amplification. 
Solid lines show sinusoidal fits to the data. (C) Contrast of the carrier phase scan, as shown 

in (B), as a function of the squeezing phase 8 for a fixed displacement. (D) Contrast as a function 
of the displacement amplitude |aj| for different initial squeezing pulse durations. Each data point 
is calculated from ~10* experiments. The data shown in (B) and (C) have initial |a;| = 0.0578 
(40.0006) and a squeezing duration of t = 8 us [nominally r = 2.54 (+0.03)]. The solid black 
line in (D) is the maximum theoretical contrast without squeezing. Dashed lines in (C) and (D) 
are derived from a numerical model that includes motional decoherence. (E) Measurement 
sensitivity enhancement in the linear small-displacement regime as a function of the ideal 
gain G = exp(gt). For each squeezing duration, the enhancement is determined by dividing 

the slope of the contrast for C < 0.25 [obtained by fitting a straight line to data points in (D) 
with C < 0.25] by the slope of the O dB black line, which represents the standard quantum 


limit (19). Error bars denote SD. 


(+0.03), and ideally 22.0 (+0.3) dB of squeez- 
ing]. Here, we achieved a contrast gain of 7.3 
(+0.3), corresponding to a factor of 53 (+4) re- 
duction in the number of measurements required 
to achieve a given SNR, equivalent to a 17.2 
(+0.3) dB enhancement in measurement sen- 
sitivity. For larger squeezing durations, degrada- 
tion of the contrast due to background motional 
heating and dephasing in our trap prevents a 
further increase in gain. This is not a limitation 
of the amplification process or our squeezing 
method. We note that with amplification, we can 
achieve a SNR of 1 for measuring a displace- 
ment of one Bohr radius (~0.0529 nm, corre- 
sponding to a ~ 0.00467), less than the extent of 
the ground-state vacuum fluctuations (o = 0.5) 
by a factor of 107, in ~200 experiments. 

We have implemented a fast unitary squeezing 
interaction in a simple mechanical oscillator and 
used it to amplify and detect coherent motional 
displacements that are significantly smaller than 
the quantum zero-point fluctuations. This ampli- 
fication technique can enhance measurement 
sensitivity in protocols that use phase-stable dis- 
placements, such as photon-recoil spectroscopy 
(26, 27), where the phase of momentum kicks 
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from photon absorption can be controlled by 
modulating the photon source. Our method can 
be extended to amplify displacements of un- 
known frequency or phase, following the recent 
proposal in (28). The parametric modulation used 
for squeezing can also be combined with a spin- 
dependent force to enhance phonon-mediated 
spin-spin interactions (28, 29), which are used 
to create entanglement in quantum simulation 
and quantum information processing experi- 
ments. Our methods are also applicable to the 
generation of exotic nonclassical motional states 
and to continuous-variable quantum infor- 
mation processing (30). Finally, we note that 
the squeezing, displacement, spin-motion cou- 
pling, and qubit control interactions used in 
this work are all generated without lasers, 
thereby eliminating spontaneous emission, sim- 
plifying control of relative phases, and enabling 
use with other charged particles lacking optical 
transitions such as electrons, positrons, and 
(anti-)protons (23). 
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Photoexcitation of flavoenzymes 
enables a stereoselective 


radical cyclization 


Kyle F. Biegasiewicz'*, Simon J. Cooper’*, Xin Gao’, Daniel G. Oblinsky’, 
Ji Hye Kim’, Samuel E. Garfinkle’, Leo A. Joyce”, Braddock A. Sandoval’, 


Gregory D. Scholes’, Todd K. Hyster’+ 


Photoexcitation is a common strategy for initiating radical reactions in chemical 
synthesis. We found that photoexcitation of flavin-dependent “ene”-reductases changes 
their catalytic function, enabling these enzymes to promote an asymmetric radical 
cyclization. This reactivity enables the construction of five-, six-, seven-, and 
eight-membered lactams with stereochemical preference conferred by the enzyme 
active site. After formation of a prochiral radical, the enzyme guides the delivery 

of a hydrogen atom from flavin—a challenging feat for small-molecule chemical 
reagents. The initial electron transfer occurs through direct excitation of an electron 
donor-acceptor complex that forms between the substrate and the reduced flavin 
cofactor within the enzyme active site. Photoexcitation of promiscuous flavoenzymes 
has thus furnished a previously unknown biocatalytic reaction. 


adical enzymes catalyze radical-mediated 
reactions with exquisite chemo-, regio-, 
and enantioselectivity (1). These catalysts, 
however, exploit strategies for radical for- 
mation that do not translate well out- 
side of the natural setting and thus do not lend 
themselves readily to be harnessed for robust 
synthetic organic chemistry, in which opera- 


allenging Transformation 
A Chall Transformati Cc 
Catalytic Asymmetric Radical Alkene Hydroalkylation 


tional simplicity, generality, and ease of reaction 
execution are desirable (2). To achieve these 
features, it would be attractive to develop bio- 
catalytic reactions that use mechanisms of rad- 
ical formation commonly used in chemical 
synthesis. If these mechanisms were to occur 
within active sites of established enzymatic plat- 
forms known to be substrate promiscuous, gen- 


eral catalysts for asymmetric radical reactions 
could be accessed. Moreover, by repurposing 
known enzymes, desired biocatalytic functions 
can be achieved while retaining characteristics 
such as ease of handling, substrate promiscuity, 
and evolvability already associated with these 
catalysts (3). 

Photoexcitation is widely used in organic syn- 
thesis to facilitate radical reactions (4) but is 
less common in enzyme catalysis. Light has been 
used to activate enzymes through conformational 
changes (5), drive charge separation in multi- 
protein systems (6), and facilitate cofactor turn- 
over or substrate epimerization (7). Direct use 
of photonic energy to drive native, biological re- 
actions is much more limited but has been docu- 
mented for protochlorophyllide reductase (8), fatty 
acid photodecarboxylase (9), and DNA photolyase 
(10). We recently found that nicotinamide- 
dependent ketoreductases (KREDs) and double- 
bond reductases (DBRs) can use photoinduced 
electron transfer to effect asymmetric radical 
hydrodehalogenation and hydrodeacetoxylation 
reactions (11, 12). These examples illustrate that 
irradiation of common oxidoreductases can draw 
out a non-natural reaction mode for catalysis, a 
feature we anticipate can be exploited to address 
challenges in chemical synthesis. 

The stereoselective coupling of electrophilic 
radicals and unactivated alkenes enables the 
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preparation of structural motifs found in agro- 
chemical and pharmaceutical agents (13). Catalytic 
strategies for rendering this type of transforma- 
tion stereoselective remain elusive (14). This is 
primarily because of the challenge of maintaining 
association between radical species and chiral cat- 
alysts throughout the stereoselectivity-determining 
step and the lack of reagents capable of supplying 
a hydrogen atom to one face of a prochiral rad- 
ical (Fig. 1A) (15). Given these challenges, we imag- 
ined that enzymes would be an ideal catalyst 
for this transformation because of their ability to 
precisely control the conformational landscape 
of reactive species. As a model for this family 
of reactivity, we targeted the development of a 
biocatalytic radical cyclization of o-chloroamides 
to afford B-stereogenic lactams (Fig. 1B). The 
lactam motif is prevalent in medicinally valu- 
able molecules (16), and the proposed synthe- 
sis would be distinct from existing biocatalytic 
approaches for generating N-heterocycles (17). 
Although this cyclization is well known in the 
radical literature, it is plagued by preferential 
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formation of the hydrodehalogenated and oligo- 
merized product, and there are no known cat- 
alytic asymmetric variants (18, 19). New enzymatic 
Csp®-Csp* bond-forming reactions are desired 
in biocatalysis (20, 27). 

We looked to flavin-dependent “ene”-reductases 
(EREDs) as a potential catalyst family for the 
proposed cyclization of a-chloroamides. Because 
EREDs have large, substrate-promiscuous active 
sites, we hypothesized that these enzymes would 
be able to bind the substrate in a conformation 
to enable cyclization (22, 23). Moreover, our re- 
cent studies demonstrated that these enzymes 
are able to control the stereochemical outcome 
of radical hydrodehalogenation reactions (24). 
Unfortunately, flavin hydroquinone (FMNnpq) is a 
modest single-electron reductant [Ey2 = -0.45 V 
versus saturated calomel electrode (SCE)], mak- 
ing electron transfer to a-chloroamides (Epy2"* = 
-1.65 V versus SCE) thermodynamically challeng- 
ing (fig. S23). The excited state of the flavin hy- 
droquinone (FMN),*) (Ey2* = -2.26 V versus SCE), 
however, should be capable of accomplishing 
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this initial electron transfer (25). Previous work 
has investigated the fundamental photophysics 
of the FMN,,* in flavin-dependent enzymes 
(26, 27). If this excited state could be used for 
the envisioned chemistry, it would transform 
flavin-dependent enzymes into chiral photo the 
envisioned chemistry, substantially expanding 
their synthetic utility. 

We began by testing the cyclization of a- 
chloroacetamide 1 to afford y-lactam 2 under 
visible light irradiation (table S1). An ERED 
from Gluconobacter oxydans (GluER) was effec- 
tive when irradiated with near-ultraviolet (UV) 
light (390 nm), furnishing the desired cyclization 
product with modest yield and enantioselectiv- 
ity (table S2). Optimization of the lighting source 
revealed cyan light (497 nm) to provide product 
with improved yield and enantioselectivity while 
producing less than 2% of hydrodehalogenation 
product (Fig. 1C, entry 1). The opposite enantiomer 
is favored by Old Yellow Enzyme 1 (OYE}) (Fig. 
1C, entry 2). Control experiments confirm that 
light, ERED, nicotinamide adenine dinucleotide 
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phosphate (NADP*), glucose, and glucose dehy- 
drogenase (GDH-105) are required for reactivity 
(table S1). To increase the activity of GluER for 
this non-natural function, we conducted mutage- 
nesis and found that mutation of T36, a residue 
located at the surface of the protein, to alanine 
(T36A) furnished improved yields of product and 
only trace quantities of the hydrodehalogenated 
product (Fig. 1C, entry 3). We solved crystal 
structures of GluER and GluER-T36A and found 
no differences in the structure that would explain 
the improved yield (backbone root mean square 
deviation of 0.53 A) (figs. S48 to $52). This mu- 
tation does not have a detrimental effect on the 
native function of this enzyme (fig. S45) and 
allows the catalyst loading to be decreased to 
0.5 mole % (mol %) without compromising yield 
or enantioselectivity (fig. S46). Moreover, this 
reaction can be conducted with lyophilized cell- 
free lysate. This feature enabled the model cy- 
clization to be run on gram scale, providing 
product with no change in yield or enantiose- 
lectivity (Fig. 1C, entry 4). The reaction could be 
run open to air with slow addition of glucose 
over 8 hours, providing comparable yields and 
enantioselectivities to reactions run under 
inert atmosphere (Fig. 1D). Alternatively, GDH- 
105, NADP”, and glucose could be replaced with 
periodic addition of sodium dithionite and de- 
gassed buffer; when run open to air, there was 
no observable change in the reaction outcome 
(Fig. 1D). 

With optimized conditions in hand, we ex- 
plored the scope and limitations of this reaction 
(Fig. 2). Aromatic substituted alkenes are tol- 
erated for 5-exo-trig cyclizations, affording product 
in high yield and enantioselectivity with substit- 
uents at the para-, meta-, and ortho positions 
(Fig. 2, 8-12). The electronic characteristics of 
these substituents had only a modest effect on 
reaction efficiency. Alkyl substituents on the 
olefin are also tolerated, affording product in 
nearly quantitative yield in all cases, albeit with 
diminished enantioselectivities (Fig. 2, 17-20). 
Because this mode of reactivity should be ac- 
cessible across the entire ERED family, we 
suspected that other members might provide 
improved levels of enantioselectivity. An ERED 
from Nostoc sp. (NostocER) accordingly pro- 
vided improved enantioselectivities for the more 
sterically demanding alkyl-substituted substrates 
(Fig. 2, 19, 20). 

We next shifted our attention to other cycli- 
zation modes. GluER-T36A catalyzes a 5-endo-trig 
cyclization, in which the stereogenic center is 
created by means of hydrogen atom transfer 
(HAT) to yield the 5-substituted y-lactam (Fig. 2, 
22). The ERED is thus capable of controlling the 
delivery of a hydrogen atom to sites distal to the 
carbonyl, a feat largely unknown in the small 
molecule literature (10, 28). A 6-exo-trig cycliza- 
tion to furnish 6-lactams occurs with GluER-T36C 
in modest yield and enantioselectivity. Improved 
levels of enantioselectivity and yield can be 
achieved by using a variant of Morphinone re- 
ductase (MorB-Y72F) (Fig. 2, 24). GluER-T36A can 
also catalyze a 7-exo-trig cyclization (Fig. 2, 26), 
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albeit with increased catalyst loadings. Last, an 
8-endo-trig cyclization is also possible by using 
GluER-T36A, affording product in high yields 
but with essentially no enantioselectivity. An 
ERED from Lactobacillus casei (LacER) provides 
significantly improved levels of enantioselec- 
tivity but slightly diminished yields, with the 
remaining mass balance being unreacted start- 
ing material (Fig. 2, 28). The latter examples 
are particularly exciting because they are under- 
represented cyclization modes in the small mol- 
ecule literature (29). 

Inspired by the selectivity of the HAT event, 
we considered the possibility of ERED-controlled 
hydrogen atom delivery to exocyclic prochiral 
radicals. Because these radicals are conforma- 
tionally flexible, controlling HAT is challenging. 
We began by testing substrates that possess tri- 
substituted alkenes for the 5-exo-trig cyclization 
mode. Substrates that contain phenyl/methyl and 
phenyl/ethyl substituents at the terminal posi- 
tion of the olefin afforded lactams with excel- 
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lent enantio- and diastereoselectivities (Fig. 2, 
30 and 32). Because these substrates provide 
an equimolar mixture of diastereomers when 
prepared with photoredox catalysts, there does 
not appear to be a thermodynamic preference 
for one diastereomer, demonstrating that the en- 
zyme is responsible for controlling the delivery of 
the hydrogen atom. The cyclohexyl/Et substituted 
alkene substrate 33, which lacks an aromatic 
group, was also a substrate for this reaction, 
with diastereoselectivity only slightly diminished 
(Fig. 2, 34). Substrates that contain substituents 
with steric bulk may thus be ideal for achieving 
high levels of diastereoselectivity. 

We conducted a series of initial rate exper- 
iments to better understand how the kinetic 
profile of the reaction compares with native al- 
kene reduction. We found that the model reac- 
tion follows Michaelis-Menten kinetics with a 
Michaelis constant (K,,) = 9.7 mM, an apparent 
unimolecular rate constant (4a) = 0.012 s, and 
a Keat/Km of 0.0012 mM s™ (fig. $27). On the 
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basis of our observation that increased light 
intensities afford increased rates and higher 
yields, we suspect that the decreased Ka, is 
due to the reaction being photon limited (fig. 
$28). We observed a quantum yield of 7.8%. 

We conducted a series of mechanistic experi- 
ments to better understand the nuances of this 
reaction, including the superior reactivity with 
cyan light-emitting diodes (LEDs) [peak max- 
imum (Amax) = 497 nm]. Reduction of GluER 
to the FMNj, oxidation state with dithionite 
revealed formation of the diagnostic FMNpgq 
signature with minimal absorption at 497 nm, 
suggesting that direct excitation of the cofactor 
is not responsible for the observed wavelength 
preference (Fig. 3A, FMNpq). Addition of 100 
equivalents of chloroamide 1 resulted in for- 
mation of a broad absorption band at Anax = 
500 nm (Fig. 3A, FMNnq + 1). Because this 
absorption feature is not lost upon addition of 
sodium dithionite, we do not attribute it to 
FMNgyq. It is lost, however, upon addition of 
150 equivalents of sodium benzoate (Fig. 3A, 
FMN), + 1 + NaOBz). These data are consist- 
ent with the formation of an electron donor- 
acceptor complex between the substrate and 
FMN),, within the enzyme active site. We pro- 
pose that excitation of this charge transfer band 
promotes the initial electron transfer from 
FMN), to 1. 

Next, we sought evidence for the existence 
of radical intermediates by preparing a sub- 
strate containing a cyclopropyl ring. GluER-T36A 
produced only products of cyclopropane ring 
opening, supporting the existence of a radical 
intermediate (fig. S9). To determine the terminal 
hydrogen atom source, we used labeled FMNDhgq 
generated in situ from D-glucose-1-d, (Fig. 3A). 
Deuterium was incorporated exclusively at the 
benzylic carbon, with a high level of diaster- 
eocontrol. These experiments support a reac- 
tion mechanism in which substrate is initially 
reduced by one electron after irradiation of 
the electron donor-acceptor complex formed 
between the substrate and FMNj, within the 
enzyme active site. The o-acyl radical can react 
with the pendent olefin to afford an exocyclic 
radical that is terminated through hydrogen 
atom transfer from neutral flavin semiquinone 
(FMN,,) to afford the product and oxidized 
flavin (fig. S47). 

On the basis of this mechanistic hypothesis, 
we reasoned that the configuration of the alkene 
may be responsible for the observed levels of di- 
astereoselectivity if HAT is faster than rotation 
of the exocyclic C-C bond. We thus performed 
the reaction on starting material 29 with (Z)- 
alkene geometry rather than the (E)-isomer. 
Both alkene geometries favor the same diaster- 
eomer and produce no change in the enantio- 
selectivity (Fig. 3C). The enzyme thus favors HAT 
from one rotamer of the prochiral radical at rates 
that are competitive with bond rotation. The di- 
minished levels of diastereoselectivity observed 
with the (Z)-alkene isomer are presumably due 
to a small degree of hydrogen atom transfer be- 
fore bond rotation. 
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Transient absorption spectroscopy with olefin 
isomers of 29 provides additional support for 
this mechanistic hypothesis. Initial excitation 
at 370 nm followed by a broad-band probe pulse 
reveals underlying dynamics of the flavin redox 
states. We used a sequential kinetic scheme to 
fit the data through global analysis. We found 
three time constants that describe the tempo- 
ral evolution of the various flavin species through 
evolutionary associated difference spectra (EADS). 
When (E)-29 is used as the substrate, a signal 
corresponding to the charge transfer state de- 
cays in 10 ps to the FMN,,. This time scale is 
consistent with previously reported rates of de- 
composition for the a-chloroacetophenone ketyl 
radical anion (30). The neutral FMNsq decays on 
the time scale of 250 ps to the flavin quinone 
(FMN,,) (figs. S42 and S43). This decay likely 
corresponds to the rate of cyclization and termi- 
nation of the radical through hydrogen atom 
transfer from FMN,,. When the same experiment 
is conducted on (Z)-29, the FMN,, lifetime in- 
creases to 700 ps (figs. S39 and S40). These data 
are consistent with post-cyclization rotation of 
the exo-cyclic radical to a conformation in which 
HAT from the FMN,, to the substrate-centered 
radical is kinetically facile. 

We anticipate that the light-promoted reac- 
tivity demonstrated here will be replicable 
widely in EREDs, KREDs, and other classes of 
oxidoreductases known in biocatalysis for their 
promiscuity and adaptability. These flavoen- 
zymes may serve as stereoselective catalysts for 
unexpected radical transformations beyond 
those demonstrated here, depending on the 
active site organization, provided starting mate- 
rial, and properties of the excited state flavin. 
By more widely investigating and exploiting 
photoexcited states of cofactors, it should be 
possible to photoinitiate radical-based reactions 
within enzymes that are otherwise inaccessible 
in the ground state. 
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RADIOTRACER CHEMISTRY 


Direct arene C-H fluorination with 
'SF" via organic photoredox catalysis 


Wei Chen™, Zeng Huang™, Nicholas E. S. Tay”*, Benjamin Giglio’, Mengzhe Wang’, 
Hui Wang’, Zhanhong Wu’, David A. Nicewicz”+, Zibo Li'+ 


Positron emission tomography (PET) plays key roles in drug discovery and development, 
as well as medical imaging. However, there is a dearth of efficient and simple radiolabeling 
methods for aromatic C—H bonds, which limits advancements in PET radiotracer 
development. Here, we disclose a mild method for the fluorine-18 (#8F)-fluorination of 
aromatic C-H bonds by an [{8F]F~ salt via organic photoredox catalysis under blue light 
illumination. This strategy was applied to the synthesis of a wide range of 1°F-labeled 
arenes and heteroaromatics, including pharmaceutical compounds. These products can 
serve as diagnostic agents or provide key information about the in vivo fate of the labeled 
substrates, as showcased in preliminary tracer studies in mice. 


luorine-18 (‘3F) is one of the most impor- 

tant radioisotopes in the radiopharmaceutical 

industry because it has a relatively long 

half-life (4/2 = 110 min) and decays with 

high efficiency by positron emission (97%) 
(1). A primary application of this radioisotope 
is in the form of 2-['*F]fluorodeoxyglucose 
((3FJFDG), which is used for oncological diag- 
noses, neuroimaging, and studying glucose metab- 
olism (2). Uptake of ['°F]FDG and other F-labeled 
agents is monitored by positron emission tomog- 
raphy (PET) imaging, which quantifies spatial 
distributions and metabolic perturbations, as 
well as site-specific chemical reactivity and en- 
suing in vivo biological processes (3). 

Many small-molecule pharmaceuticals and 
therapeutics contain aromatic or heteroaromatic 
systems within their framework, thus presenting 
a common organic subunit for the installation of 
radioisotopes to yield radiotracers with imaging 
utility. In particular, the direct conversion of 
arene C-H into C-""F bonds is ideal owing to the 
prevalence of aromatic C-H bonds and the in- 
creasing importance of C(sp”)-F bonds in small- 
molecule therapeutics and probes (4, 5). Direct 
8F-fluorination of aromatics currently requires 
the use of electrophilic fluorine sources, the sim- 
plest of which is ['°F]F,; however, this gaseous 
reagent is incompatible with many common or- 
ganic functional groups and suffers from low 
molar activity (the measured radioactivity per 
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mole of compound) as a result of its production 
methods (6-8). Most modern methods for C-H to 
C-F bond conversion require electrophilic fluorine 
in the form of relatively expensive but bench-stable 
reagents such as N-fluorobenzenesulfonimide 
(NFSI) and 1-chloromethyl-4-fluoro-1,4-diazo- 
niabicyclo[2.2.2 octane-bis(tetra-fluoroborate) 
(Selectfluor). However, their utility for SF radio- 
labeling via electrophilic fluorination is dimin- 
ished because [‘°FIJNFSI and [°F ]Selectfluor are 
prepared from ['°F]F» (9), which results in even 
lower molar activities of the F-labeled tracers. 
Therefore, a method for direct conversion of a 
C-H to a C-'*F bond via a nucleophilic arene 
fluorination strategy is highly attractive because 
the synthesis of 'F-radiolabeled pharmaceu- 
tical compounds can be accomplished with high 
molar activity fluoride ((’F]F ). Furthermore, 
direct C-H to C-'°F conversion should alleviate 
synthetic burdens associated with the synthesis 
of radiotracer precursors (JO) and allow for the 
recovery of precious, unreacted starting material. 
However, the primary hurdle for this approach is 
the lack of reactivity for a majority of simple 
aromatics with [°F]F-. 

Nucleophilic aromatic substitution of electron- 
deficient arenes has been the standard method 
for [°FJF- incorporation (1), but prefunctionali- 
zation of the aromatic subunit with electron- 
withdrawing groups is required for this strategy 
(3). Thus, modern radiofluorination methods 
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have sought to generalize the arene scope for 
this transformation. Current strategies include 
18F-deoxyfluorination of phenols via uronium 
(11) and N-arylsydnone (72) intermediates; dis- 
placement of sulfonium salts (73); fluorodemetal- 
ation of preformed palladium or nickel arene 
complexes from the requisite aryl halides or 
boronic acids (J4, 15); and copper-mediated 
cross-coupling of preformed or in situ-generated 
aryliodoniums (J6, 17), aryl boronic acids (18), 
esters (19), and arylstannanes (20) (Fig. 1A). De- 
spite the advances in radiofluorination, these 
approaches can be technically challenging for 
radiochemists. Moreover, the use of metal re- 
agents, especially in stoichiometric or super- 
stoichiometric amounts, can complicate the 
quality control process for translational studies 
because additional analysis on residual metal 
levels is required. 

With these challenges in mind, we sought to 
develop an arene C-H fluorination method com- 
patible with [°FJF- (Fig. 1B). The Nicewicz lab 
has developed a research program using organic 
single-electron photooxidants to catalytically gen- 
erate arene radical cations as reactive intermedi- 
ates for arene C-H functionalization reactions. 
Thus, we applied this strategy for direct C-H to 
c-'*F bond conversion. Considering the low 
molar amount of no-carrier-added [°F]F-, radi- 
ation exposure, and the potentially high cost of 
[’FIF production, we decided to first use !F- 
fluoride for the development of a photoredox- 
catalyzed method for arene C-H fluorination. 
Using diphenyl] ether as the aromatic substrate, 
we were able to obtain 17% of fluorinated adducts 
in a 13:1 para:ortho ratio using an acridinium- 
based photooxidant (1), cesium fluoride (CsF), 
the phase-transfer reagent tetrabutylammonium 
bisulfate (TBA-HSO,) and 2,2,6,6-tetramethyl-1- 
piperidine 1-oxyl (TEMPO) as a redox co-mediator 
under aerobic conditions for 24 hours (Fig. 2 
and table S1). We attribute the low yields to the 
relative recalcitrance of fluoride, which is a 
Bronsted base and can form strong hydrogen 
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Fig. 1. An organic photoredox approach to /*F labeling of arenes for PET studies. (A) Prior work relies on the isolation of aryl organometallic 
species. (B) This study obviates the need for metalation by using organic photoredox catalysis. 
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Fig. 2. Reaction scope of !°F-fluorination of aromatics. All RCYs are 
decay-corrected and averaged over three experiments unless otherwise 
noted. Asterisk indicates yield averaged over five experiments. Single-dagger hex, hexyl; iPr, isopropyl; t-Bu, tert-butyl; Me, methyl; Ph, phenyl: OTf, 
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formed. Double-dagger symbol indicates that the RCYs listed are based on 
the product deprotection yields (see SM5.6). Bu, butyl; MeCN, acetonitrile; 
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bonds in aqueous environments (2/). Despite 
lower yields for the aryl C-H fluorination with 
1F~ C-H fluorination was feasible and thus we 
decided to extend our method to radiofluorination. 

In this new system, [‘°F]JF~ is employed as the 
limiting reagent, and we envisioned that the 
relatively higher concentration of arene radical 
cation would efficiently capture ['SF]F-. Tran- 
sitioning from F- to [°FJF” necessitated a re- 
examination of fluoride sources and irradiation 
method while aiming to achieve reasonable 
18F-_labeling yields on 30 min to 1 hour time scales, 
given the fleeting half-life of the fluorine radio- 
isotope. High molar activity aqueous ["SF]F” is pre- 
pared via proton bombardment of ['*O] water 
and subsequent elution of ““F-fluoride with tetra- 
butylammonium bicarbonate to yield ['SF]TBAF, 
or potassium carbonate complexed with the 
aminopolyether cryptand Kryptofix 2.2.2 to form 
[SFIKF-Ky»» (1). In both of these systems, excess 
tetrabutylammonium bicarbonate and potassium 
carbonate are present. Although the initial radio- 
chemical yields (RCYs) were relatively low (0.57%), 
['SFJTBAF was the most effective ““F-fluorinating 
agent in the synthesis of [‘°F]2 [see section 5.5 
of the supplementary materials and methods 
(SM5.5)]. Initial RCYs using 455-nm light-emitting 
diodes were relatively low (0.57%), which prompted 
us to reevaluate the light flux for the transfor- 
mation. Using top-down irradiation with a 3.5-W 
laser (450 nm) (22) for 30 min (cooled to 0°C to 
prevent solvent loss from laser-generated heat) 
under an aerobic atmosphere afforded a marked 
increase in the yields of the F-fluorinated ad- 
ducts of diphenyl ether (25.8 and 2.0% for the 
para and ortho adducts, respectively). After ex- 
tensive optimization (SM5.5), we identified a sys- 
tem that afforded the para and ortho fluorinated 
adducts (37.1 and 2.0% yield, respectively) but 
observed a drop in the molar activity of ['°F]- 
TBAF. We attributed this inconsistency to the 
exchange of ['°F]F~ with fluoride from the BF, 
counterion of the acridinium catalyst, which in 
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Fig. 3. Examples of PET tracers synthesized via arene C—H 
radiofluorination. (A) Maximum intensity projection (MIP) PET images 
of ['®F]-fenoprofen (42) demonstrate higher uptake in TPA-treated 
mouse ear (A-1) compared with control (A-2) mouse ear. (B) PET/CT 
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turn lowers the net molar activity of [SF]-TBAF. 
To circumvent this problem, we synthesized an 
acridinium containing a perchlorate (ClO, ) coun- 
terion and found that “F-labeled diphenyl ether 
(2) adducts could be isolated with a molar ac- 
tivity of 1.37 Ci/umol (compared with 0.39 Ci/umol 
with BF, counterion) and comparable RCYs of 
38.2 and 1.8% for the para and ortho products, 
respectively. 

Having identified the optimal catalytic con- 
ditions for the transformation, we turned our 
attention to evaluating the scope of this method 
with a range of aromatic and heteroaromatic 
substrates. Biphenyl (8) and naphthalene (4) 
were fluorinated at the 4- and 1-positions, re- 
spectively, in good RCY. 2-Substituted methox- 
yarenes (5 to 11) were also efficiently fluorinated 
at the C-H site para to the methoxy group, fol- 
lowing a similar trend to the amination (23) 
and cyanation (24) methods from our laboratory. 
Halogenated and pseudohalogenated methoxy- 
arenes 2-bromoanisole (5), 2-chloroanisole (6), 
and 2-OTf-anisole (7) were all fluorinated at the 
4-position in good RCY, thus demonstrating the 
compatibility of halogenated and pseudohalo- 
genated arenes with our system, as these func- 
tional groups are typically susceptible to oxidative 
addition in transition metal-catalyzed meth- 
ods. A range of carbonyl-containing functional 
groups, such as esters (8), ketones (9), nitriles 
(10), aldehydes (11), and amides (12), were all 
compatible with this method, affording sin- 
gle regioisomers of the ‘*F adducts. Methoxy- 
substituted biphenyl systems (13 and 14) were 
also competent substrates for fluorination, af- 
fording the single regioisomers in moderate 
RCY. We did not observe significant ortho fluo- 
rination for 2-substituted anisoles, which we 
attribute to a greater gain in positive charge 
density on the para position of the arene cation 
radical (25) and potential steric hindrance. Thus, 
we were curious to explore the amenability of 
our radiofluorination protocol to methoxyarenes 
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bearing substitution at the para position. We 
found that these 4-substituted congeners (15 to 
19) were compatible fluorination partners, al- 
though the RCYs observed were significantly 
lower than their 2-substituted counterparts. 
Nevertheless, the isolated yields for these 4- 
substituted methoxyarenes are acceptable for 
PET imaging applications. Preliminary compu- 
tational studies (figs. S97 and S98) suggest that 
there is little correlation between the calculated 
electrophilicities of the cation radical for the 
2- and 4-substituted arenes and the observed 
yields, thus suggesting a greater contribution 
of steric effects in determining site selectivity. 
When 3-methoxyacetophenone was subjected 
to the radiofluorination conditions, a 2:1 ratio 
of '$F-labeled adducts was obtained with a pref- 
erence for fluorination para to the methoxy 
group (20). Computational studies suggest that 
the 6-position gains the most electrophilic char- 
acter relative to the 2- and 4-positions (fig. S99) 
upon single electron oxidation; however, only 
the latter products are observed. Taken together, 
these results demonstrate a strong preference 
for the para functionalization of methoxyarenes 
except in cases where the para position is oc- 
cupied, and they suggest that steric effects may 
play a role in dictating site selectivity for C-H 
fluorination. 

Arenes containing guaiacol motifs are found 
in numerous plant and animal metabolites; thus, 
the application of our C-H radiofluorination strat- 
egy could enable the facile synthesis of “F radio- 
tracers from renewable sources. We found that 
ethylguaiacol (21), vanillylamine (22), noniva- 
mide (23), and zingerone (24) derivatives were 
all successfully fluorinated to provide single regio- 
isomers of the ‘SF analogs. Furthermore, this 
reaction is not limited to methoxyarenes, as dem- 
onstrated by mesitylene undergoing fluorination 
(<9%) under aerobic conditions to give 25. This 
isolated RCY was improved to 50% by employing 
a modified anaerobic system (using TEMPO as a 
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net oxidant in MeCN with N, bubbling). Fluo- 
rinated heterocycles are privileged and highly 
desirable motifs in pharmaceutical and agro- 
chemical research, and substantial resources are 
directed toward measuring the pharmacokinetics 
of these compounds (26). Our radiofluorination 
protocol was applied to several heterocyclic clas- 
ses: We found that 2,5-dimethoxypyridine (26), 
2-chloro-6-methoxyquinoline (27), and N,N- 
dihexylquinazolinedione (28) were all success- 
fully fluorinated in good to moderate RCY. 
Benzazoles common to many therapeutics, such 
as N-methylindazole (29), benzoxazole (30), and 
benzimidazole (31), all underwent fluorination 
at the most electrophilic positions of their re- 
spective cation radicals. Selective late-stage arene 
C-H fluorination is an attractive synthetic strategy, 
as it circumvents the need for complicated labeling 
precursors and enables the straightforward con- 
version of bioactive molecules and drugs into PET 
agents for in vitro companion diagnosis or for 
pharmacodynamic and pharmacokinetic studies. 

We chose to apply our "$F radiofluorination 
method to several nonsteroidal anti-inflammatory 
drugs (NSAIDs), which are an important class of 
pharmaceuticals that alleviate pain and inflam- 
mation by inhibiting the activity of cyclooxygenase 
enzymes (COX-1 and COX-2). Although there 
have been recent advances in the radiolabeling 
of COX-1 and COX-2 inhibitors, many examples 
use “C as the radionuclide, which has the dis- 
advantage of a shorter half-life (4/2 = 20.2 min) 
than °F (27). Existing ““F-labeled COX inhibitors 
typically incorporate 'F in the radioprobe as 
part of a phenol-appended fluorinated alkyl 
chain (28). However, these functional groups are 
prone to metabolic degradation and thus may be 
less-effective radiotracers (27, 29). Fluorination 
of the aromatic ring is a strategy typically used to 
study drug metabolism, as fluorine is a hydrogen 
bioisostere and its substitution slows the meta- 
bolic degradation of drug molecules by cyto- 
chrome P450 (29). Fluorinated aromatics can 
also act as metabolic tracers because hydrox- 
ylated fluoroarene metabolites undergo a 1,2- 
fluoride shift (NIH shift), thus allowing for the 
detection and quantification of metabolic by- 
products (30, 32). Furthermore, the introduction 
of fluorine into the aromatic system can improve 
the potency and cell permeability of drug mol- 
ecules through noncovalent interactions (5). The 
development of the NSAID celecoxib is an in- 
structive example, in which the substitution of 
various aryl C-H bonds for aryl C-F bonds was 
used to bias in vitro COX-2 selectivity (32). 
However, routes for the analogous synthesis of 
C(sp”)-'*F bonds in COX inhibitors with aro- 
matic moieties are underexplored because of dif- 
ficulties with designing late-stage precursors for 
18F radiolabeling (33). Thus, we envisioned that 
our method would enable the introduction of °F 
into known COX inhibitors. The NSAID deriva- 
tives fenoprofen methyl ester (32), flurbiprofen 
methyl ester (33), and O-methyl methyl salicy- 
late (8) were all fluorinated in good to moderate 
RCYs. Given the ubiquitous use of these commer- 
cial NSAIDs (34), the synthesis of their radio- 
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tracer counterparts could provide researchers 
with a method for visualizing their immediate 
in vivo metabolic fates that is complementary 
to the longitudinal metabolism studies enabled 
by ?H- and “C-labeling strategies (35). 

The hypolipidemic agents clofibrate (34) and 
fenofibrate (35), as well as a derivative of the 
biological neurotransmitter precursor DL-DOPA 
(36), were selectively fluorinated in moderate RCY 
after 30 min. We found that the fluorination 
protocol was influenced by reaction times, as 
extending the runtime to 1 hour increased the 
RCY of the fluorinated DOPA derivative to 21.2%. 
This result is especially noteworthy because 
['SF]DOPA is an important radioprobe for the 
PET imaging of CNS disorders (36), but pub- 
lished routes to it typically require extensive 
and sensitive synthesis with ‘SF precursors (37) 
or the fluorination of prefunctionalized DOPA 
analogs (J6, 19). This fluorinated DOPA deriv- 
ative was then subjected to facile global depro- 
tection to yield ['SF]-DOPA (37) in 12.3% RCY. 
Other aromatic amino acids, such as the pro- 
tected variants of O-Me-ortho-tyrosine and 4- 
phenyl-phenylalanine, were also successfully 
radiofluorinated (38 and 40, respectively), and 
their deprotected forms (39 and 41, respectively) 
were accessed with relative ease. 

A major objective of our synthetic methodol- 
ogy was to develop clinically relevant PET tracers 
from readily available bioactive molecules with- 
out first requiring arene prefunctionalization. To 
test this concept, we examined the feasibility of 
converting the NSAID fenoprofen into a PET 
agent by direct C-H fluorination. Fenoprofen 
has notable anti-inflammatory activity (38), but 
there have been minimal studies with its fluo- 
rinated analogs. Given the widely exploited hy- 
drogen bioisosterism of fluorine in medicinal 
chemistry (5), we were interested in examining 
the viability of the radiofluorinated analog for 
PET studies. ['°F]-Fenoprofen (42) was readily 
accessed from 32, which was then used to detect 
inflammation induced by 12-0-tetradecanoylphorbol- 
13-acetate (TPA) in mouse ears (Fig. 3A and fig. 
S94) (see supplementary materials and methods 
for more details) (39). Preliminary ex vivo PET 
studies (Fig. 3A) show significantly higher up- 
take of [°F ]-fenoprofen in the ear inflammation 
model relative to the control 30 min after in- 
travenous introduction of the radioprobe. These 
data suggest that 42 is a potential PET agent that 
demonstrates preferential accumulation in in- 
flamed tissue. Additional biological evaluations 
for 42 are needed but are currently beyond the 
scope of this Report. 

Another application of our C-H radiofluorina- 
tion method is rapid radioligand screening in 
drug discovery and development. We chose to 
highlight synthetic aromatic amino acids as a 
class of bioactive metabolites owing to their 
applicability for oncological PET imaging (40-42). 
We are especially interested in the tyrosine scaffold, 
as fluorination on the aromatic ring is an impor- 
tant functionalization mode for developing PET 
probes. We selected O-Me-ortho-tyrosine and 
4-phenyl-phenylalanine as the working examples, 


and our method provides facile access to radio- 
fluorinated 39 and 41. In vivo PET studies of 
mice containing MCF7 (breast cancer) and 
U87MG (glioblastoma) tumor xenografts dem- 
onstrate prominent uptake of 39 with minimal 
accumulation in other organs except the pancreas 
and bladder (Fig. 3B and fig. S95) (see supple- 
mentary materials and methods for more details). 
Conversely, 41 displayed low uptake in similar 
tumor models with significantly higher reten- 
tion in the mouse circulatory system. On the basis 
of these preliminary studies, 39 shows promise 
as a Selective amino acid radioprobe for tumor 
detection, and further studies will be needed to 
examine its biological activity and pharmacology. 
These results further demonstrate the potential 
of our radiofluorination method for the discov- 
ery of new PET agents that circumvents the need 
for prefunctionalized (hetero)arenes. 

18F radiolabeling is an important tool for 
noninvasive studies of biological systems, and 
we anticipate that the applicability of our radio- 
fluorination method to commercial pharmaceu- 
ticals and metabolites will enable direct access to 
new classes of translationally relevant °F radio- 
tracers, either as diagnostic agents or as target 
probes for elucidating the in vivo fate of metab- 
olites or pharmaceuticals. 
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COLLOIDAL MATERIALS 


Particle analogs of electrons 


in colloidal crystals 


Martin Girard””***, Shunzhi Wang”’**, Jingshan S. Du”**, 
Anindita Das®***, Ziyin Huang’, Vinayak P. Dravid’*, Byeongdu Lee’, 
Chad A. Mirkin”**+, Monica Olvera de la Cruz?”***+ 


A versatile method for the design of colloidal crystals involves the use of DNA as 

a particle-directing ligand. With such systems, DNA-nanoparticle conjugates are 
considered programmable atom equivalents (PAEs), and design rules have been 
devised to engineer crystallization outcomes. This work shows that when reduced 
in size and DNA grafting density, PAEs behave as electron equivalents (EEs), 
roaming through and stabilizing the lattices defined by larger PAEs, as electrons do 
in metals in the classical picture. This discovery defines a new property of colloidal 
crystals—metallicity—that is characterized by the extent of EE delocalization and 
diffusion. As the number of strands increases or the temperature decreases, the 
EEs localize, which is structurally reminiscent of a metal-insulator transition. Colloidal 
crystal metallicity, therefore, provides new routes to metallic, intermetallic, and 


compound phases. 


he interactions among electrons and atoms 

to form molecules and materials are foun- 
dational in physics and chemistry. How- 
ever, in the science of colloidal crystals, in 
which particles are often analogized with 
atoms, a particle analog to electrons has not 
been invoked, despite the synthesis of hundreds 
of colloidal crystals (7-24) and the development 
of certain approaches into elaborate forms of 
crystal engineering (9-27). In particular, colloi- 
dal crystal engineering with DNA has led to the 
design of structures with diverse symmetries, 
lattice parameters, and crystal habits (12-22). 
However, to date, the particles modified with 
DNA that define such structures behave as pro- 
grammable atom equivalents (PAEs) and have 
fixed particle positions at set stoichiometric ra- 
tios. These systems are governed by a set of de- 
sign rules and the complementary contact model 
(CCM) (74), the premise that they organize them- 
selves to maximize contacts that lead to hybrid- 
ization and structures such as ionic compounds. 
We report on an electron-atom duality analog 
in colloidal crystal engineering with DNA, in 
which the resulting colloidal assemblies are bet- 
ter classified as “metallic” structures. In such struc- 
tures, small DNA-functionalized NPs become 
mobile and “electron-like” [or electron equiv- 
alents (EEs)] and are essential for maintaining 
the positions of the larger PAE “atom” components. 
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Mixtures of complementary DNA-functionalized 
nanoparticles (NPs) that vary in size and DNA 
surface density were assembled and character- 
ized by means of electron microscopy, synchro- 
tron small-angle x-ray scattering (SAXS), and 
scale-accurate molecular dynamics (MD) sim- 
ulations with explicit hybridization (17, 25, 26). 
Through a combination of theory, simulations, 
and experiments, we show that small particles 
grafted with low numbers of DNA strands (for 
example, <6), when mixed with complementary 
functionalized NPs (Fig. 1A), form crystals but 
do not occupy specific lattice sites and diffuse 
through the crystal in a manner reminiscent of 
classical electrons in metals, as described by 
the original Drude model. The PAEs alone will 
not form crystals because they are almost solely 
repulsive. The delocalized EEs that move freely 
through the lattice are responsible for stabiliz- 
ing it, a type of bonding more reminiscent of 
metals than ionic compounds (Fig. 1B). 

Furthermore, when the interactions are tailored 
by increasing the number of potential DNA 
bonds or lowering the temperature, these EEs 
condense into specific locations, yielding a tran- 
sition akin to a metal-insulator transition. Last, 
by taking advantage of this duality and the struc- 
tural features of the DNA-modified particles 
that govern it, we realized three polymorphic 
crystal phases—body-centered cubic (bcc), face- 
centered cubic (fcc), and Frank-Kasper A1l5— 
and analyzed the distribution and diffusion of 
the particles (EEs and PAEs) within them as a 
function of temperature and number of link- 
ers per EE. 

In a typical set of experiments, 10-nm-diameter 
Au NPs were densely modified with single-stranded 
propylthiolated DNA to yield conjugates with 
~160 strands per NP. These modified NPs were 
hybridized with a complementary strand to 
form a rigid duplex region (18 bases) with a six- 
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base single-stranded overhang (fig. S1). NPs 
with average diameters of 10, 5, 2, and 1.4 nm, 
respectively, were modified in a similar manner 
but with a second type of complementary DNA 
overhang (Fig. 1A). The average number of DNA 
overhanging strands available for bonding (num- 
ber of linkers per EE) is a function of input linker 
concentrations in the solution (Fig. 1F). For the 
first three combinations of NPs (10 + 10, 10 + 5, 
and 10 + 2 nm), all formed the expected CsCl 
lattice (space group Pm3m) (27) based on the 
conventional CCM model and the description 
of them as ionic compound analogs (14). 

However, with the 10 + 1.4 nm combination, 
the 10-nm NPs assumed a bcc lattice (space group 
Im3m), but the 1.4-nm NPs were invisible to SAXS 
(Fig. 1C). The lattice assignments were all verified 
by means of electron microscopy with low-angle 
annular dark field (LAADF) imaging after the 
structures were encased in silica (28), whereas 
the 1.4-nm NPs in the 10 + 1.4 nm combination 
did not appear at specific lattice sites (Fig. 1D). 
For the first three combinations, the NP posi- 
tions were determined by the length of the DNA 
bonding elements that define them (Fig. 1E). 
However, for the 10 + 1.4 nm combination, there 
was a marked decrease in the interparticle dis- 
tance compared with the expected value based on 
CCM prediction (74). 

To visualize the positions of the 1.4-nm NPs, 
we performed cryogenic transmission electron 
microscopy (cryo-TEM) on the as-synthesized 
bec lattice formed from the 10 + 1.4 nm NPs and 
then stacked the repeating “unit cell” images and 
EE locations along the [111] zone axis (Fig. 2A). 
Cryo-TEM showed that the large NPs (PAEs) 
assumed a bcc lattice, and the small NPs (EEs) 
were randomized throughout that lattice (Fig. 
2B), which is in agreement with the MD sim- 
ulations (Fig. 2D). In the simulations, crystalline 
structures were obtained for mixtures of comple- 
mentary DNA-functionalized Au NPs at a fixed 
size ratio (10- to 2-nm diameter) but with var- 
iable EE:PAE ratios from 4:1 to 12:1 and num- 
ber of linkers per EE from four to eight (Fig. 2, 
D and E). Because the MD simulations were 
performed in the isobaric-isothermal ensemble 
(NPT) with pressure near zero (supplementary 
materials), these complementary EEs, which are 
delocalized from specific lattice sites (Fig. 2D), 
were responsible for the attraction that holds 
the large PAEs in crystalline positions in these 
metal-like assemblies. 

We determined the degree of delocalization of 
EEs in the MD simulations by discretizing the 
unit cell volumes V..y into (128) voxels of equal 
volume a? so that Vo. = (128a)’ and then count- 
ing the EE visitation frequency in each voxel. This 
frequency gives a probability distribution f, in 
each voxel, #. A quantitative measure of clus- 
tering tendency, Sy, is defined as 


Sa = —)3, fella Fe) + In(Veen/a3)— (1) 
where dp is the average of a over all simulations 


and used as a constant value to normalize the 
volume. The quantity S, can be associated with 
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an information entropy (29). This quantity is 
minimized if the EEs are localized and maxi- 
mized if they are delocalized (when they have a 
uniform distribution). The resulting S. shows a 
strong correlation with both EE:PAE ratio and 
number of linkers per EE (Fig. 2F): Either an 
increase in the number of EEs in the lattice or a 
decrease in the number of duplexed DNA link- 
ers on the EEs resulted in a more randomized 
density distribution of EEs in the lattice. 

The EEs in the PAE-EE assemblies are classical 
particles and as such follow a Boltzmann dis- 
tribution. Thus, the movement of EEs in time is 
related to free-energy barriers that allow for an 
exponential relationship between their trapping 
time and temperature (Fig. 2G), which is directly 
related to the degree of EE delocalization. To vi- 
sualize the spatial distribution of EEs from MD 
simulations, we calculated the cumulative den- 
sity of EEs by integrating f, from its maxima 
(where the density of EEs is the highest) and 
drew isosurfaces that separate the unit cell into 
equally probable accessible volumes for the EEs, 
termed Boltzmann volumes (supplementary 
materials). The Boltzmann volumes for an as- 
sembly with a low number of DNA linkers per 
EE were widely dispersed across the volume 
of the crystal (Fig. 2H). Increasing the num- 
ber of linkers per EE resulted in the localiza- 
tion of EEs into a group of locations near the 
B sites in ABg-type binary compounds (Fig. 21 
and Table 1). This localized state is similar to 
the electron charge-density distributions in 
semiconductors and insulators (30). Further- 
more, the Boltzmann volumes became more dis- 
persed as the EE:PAE ratio increased (Fig. 2J 
and fig. S33). Such a response was also experi- 
mentally observed by comparing the projected 
EE locations determined with cryo-TEM on 
samples of bcc assemblies with varying input 
EE:PAE ratios and EE linker concentrations in 
the solution. The EE local density in the crystal 
with a low EE:PAE ratio and high linker con- 
centration (Fig. 2C) around the predicted lo- 
calization sites was substantially higher than 
the uniform distribution baseline and than the 
local densities in the assemblies with higher 
EE:PAE ratios and lower linker concentrations 
(Fig. 2B and fig. S17). 

Compared with PAEs that reside on relatively 
fixed lattice sites, EEs in a PAE-EE assembly are 
macroscopically mobile beyond local vibrations. 
Time-series MD simulation snapshots showed 
that the EEs could diffuse between unit cells 
(Fig. 3A, fig. S39, and movie S4). To further probe 
the diffusion of EEs in experimentally realized 
assemblies, 10-nm PAEs and Cy5-DNA-labeled 
14-nm EEs were assembled in the bec structure. 
Subsequently, these crystals were incubated with 
a solution of Cy3-DNA-labeled EEs. To track any 
exchange of EEs between the crystalline assem- 
bly and EEs in solution, the ultraviolet-visible 
(UV-vis) extinction spectra of the supernatant 
were measured over time (Fig. 3B, top). A simul- 
taneous increase of Cy5 signal and reduction 
of Cy3 signal suggested that the EEs were highly 
mobile and could diffuse macroscopically be- 
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Fig. 1. Transition from PAE-PAE systems to PAE-EE systems. (A) Illustrations of DNA- 
functionalized Au NPs behaving as programmable atom equivalents (PAEs) or electron 
equivalents (EEs) used in the MD simulation. (B) Snapshots from the MD simulation depicting 
“ionic” bonding behavior shown by PAE + PAE assemblies, and “metallic” bonding behavior 
shown by PAE + EE assemblies, where roaming EEs hold the crystal of repulsive PAEs 
together. (C and D) Four crystalline lattices assembled from 10-nm PAEs and complementary 
DNA-functionalized Au NPs (nominal core diameters of 10, 5, 2, and 1.4 nm, respectively). 
Shown are (C) SAXS spectra, (D) models, and cross-sectional LAADF images of 
silica-encapsulated samples. Scale bar, 25 nm. The 1.4-nm Au NPs in (D) (yellow arrows 
indicate visually identified ones) are dispersed randomly in the lattice and do not occupy 
specific lattice sites. (E) SAXS-determined distance between bonding 10-nm PAE pairs 
(same DNA type, defined in the inset according to CCM assumptions). (F) Quantification of 
linker DNA strands duplexed on 1.4-nm Au NPs (EEs) as a function of the input number of 
linkers per EE in the solution. 


Table 1. Symmetry of PAE-EE assemblies in the fully localized state. Wyckoff positions in 
square brackets have higher energies than the ground-state configurations. 


PAE lattice Space group Localized EE Wyckoff positions 
Dee atid ce tote an ee entre 1S Unter ante a ar aier Oo cornet LOD ebart cer 
fee ohn eee ee tee AUELUENE eels et, meme ne Del nati Bee ee ee 
A15 Pm3n 6c, 16i, 24k, [12], [24] 
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Fig. 2. Spatial probability distribution of A 
EEs in PAE-EE assemblies in the bec 
structure. (A) Workflow for obtaining EE 
location-labeled “unit cell” images from 
cryo-TEM by using image segmentation. 
ACF, auto-correlation function. (B and C) 
Overlay of averaged-intensity TEM images 
and identified EE locations in “unit cells” 
along the [111] direction. The input parame- 
ters refer to the ratio of each substance 
added to the solution, not within a crystal. 
(D and E) MD simulation snapshots of 
PAE-EE assemblies. (F) Measure of 
clustering tendency (S,;) of EEs in MD 
simulations. (G) Temperature-dependent 
trapping time (t) of EEs in MD simulations. 
kg, Boltzmann constant. (H to J) Simulated 
Boltzmann volumes of EEs viewed along 
the [111] direction with EE:PAE = 4:1 and (H) 
4 or (I) 8 linkers per EE, or with (J) EE:PAE = 
9:1 and 8 linkers per EE. Orange dashes 
approximate a repeating “unit cell” used in 
cryo-TEM image analysis. 
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Fig. 3. Diffusion of EEs in PAE-EE assemblies in the bcc structure. 

(A) Trajectory of one EE over time in a lattice of PAEs (yellow) from the MD 
simulation. The color of the EE positions (red to green to blue) represents 
diferent time points. (B) The exchange of dye-labeled particles between 
crystalline lattices and solution was monitored by the change of light extinction 
in solution by means of UV-vis spectroscopy. Cy5-DNA-labeled EEs were 
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Wavelength (nm) 
exchanged from “metallic” PAE-EE assemblies (10 + 1.4 nm, bec) by Cy3-DNA- 
labeled EEs in the supernatant over 24 hours (top), whereas no appreciable 
exchange was observed for PAE-PAE assemblies (10 + 10 nm, bcc) (bottom). 
(C and D) Predicted colloidal crystal metallicity (M,-) in bcc assemblies 

with different number of linkers per EE. The Mc. value has a minimum against 
the EE:PAE ratio at 6:1 (C) but is monotonic against temperature (D). 
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tween the colloidal assemblies and solution. In 
comparison, no appreciable exchange events 
were observed for the bcc assemblies formed 
by 10 + 10 nm PAEs modified with identical 
dye-labeled DNA (Fig. 3B, bottom), suggesting 
a much weaker diffusion of PAEs as compared 
with that of EEs. In both cases, the crystallinity 
of the assemblies was preserved after the ex- 
change (fig. S18). 

The tunable spatial density distribution of 
EEs (based on number, temperature, and linker 
density) and their macroscale diffusion inside 
a crystalline PAE framework establish the con- 
cept of colloidal crystal metallicity, M,,, as 


Mee = Sa - Ween) (2) 


where NN.) is the number of EEs per unit cell 
and is used to make the metallicity indepen- 
dent of crystal unit cell and to reduce to the ideal 
entropy of a gas in the limit of noninteracting 
particles. In Eq. 1, the EEs are considered as a 
group (Fig. 2F), but in Eq. 2, M,. is a quantita- 
tive measure of the average degree of delocal- 
ization per EE, which can decrease when the 
EE:PAE ratio increases (Fig. 3C) even if Sq in- 
creases or remains nearly the same (Fig. 2F). A 
metallicity minimum was attained at EE:PAE = 
6:1, suggesting that the metal-like bonds in the 
colloidal system become saturated. Increasing 
the number of duplexed strands on EEs led to 
crystals with lower M,, values (Fig. 3C) because 
the EEs were more localized or trapped as the 
frequency of DNA binding events increased. 
When the system temperature increased, M,. 
also increased (Fig. 3D) because of the decrease 
in the number of hybridized sticky ends. These 
results show that the EEs can undergo a clas- 
sical (nonquantized) process analogous to a 
metal-insulator transition observed in atomic 
solids (32) because the degree of EE delocal- 
ization and diffusion drastically changed when 
either the temperature, the number of linkers 
per EE, or the EE:PAE ratio was varied. 
Colloidal crystal metallicity depends on the 
number of particles, the number of DNA strands 
that can engage in bonding, and the strength of 
the bonds formed. Because these parameters are 
difficult if not impossible to control in purely 
electrostatic systems owing to electroneutrality 
requirements, metallicity has been neither ob- 
served nor defined in conventional ionic col- 
loidal crystals (crystals formed from oppositely 
charged particles) (4, 9). The complementary 
DNA design on NPs ensures that a crystal will 
not form from PAEs alone (and vice versa from 
EEs alone), which distinguishes this system 
from a description of small particles doped in a 
crystalline solid formed from larger particles. 
Moreover, although this concept of metallicity 
superficially resembles “sublattice melting” in 
superionic conductors (32), in which one sub- 
lattice of an ionic compound loses long-range 
order while the other is fixed (for example, 
Agl), ion diffusion in such systems is mediated 
by Frenkel defects because of the constraint of 
charge balance. Thus, ionic systems have lim- 
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Fig. 4. Equilibrium phases realized by PAE-EE assemblies. (A) Schematic representation 

of the equilibrium conditions of bcc, fcc, and Al5 phases and (B) their corresponding thermal 
melting transitions. (©) SAXS spectra showing the equilibrium phase transition from bcc 

(red, input EE:PAE = 10:1) to a bcc/fec mixture (purple, input EE:PAE = 20:1), and then to a 
majority fcc phase (blue, input EE:PAE = 40:1). (D) Experimental (green) and simulated 
(black) SAXS spectra of Al5 assemblies. (E) Cryo-TEM image of an A15 assembly. (Inset) A15 
lattice model along the [001] direction. (F and G) Simulated Boltzmann volumes of fcc (F) and 
A15 (G) phases as a function of the number of linkers per EE and EE:PAE ratio at a constant 
temperature (kg7 = 1.30). A whole graph is reconstructed by 1/8 of the unit cells from each 
combination. (H and I) Predicted colloidal crystal metallicity (Moc) in (H) fee and (1) A15 
assemblies as a function of the number of linkers per EE and EE:PAE ratio. Two low-metallicity 


configurations (gray shades) are present in (I). 


ited tunability and more strict stoichiometry 
requirements compared with those of the col- 
loidal crystals formed through DNA-directed 
assembly events. 

New phases were accessed by adjusting the 
input EE:PAE ratio in solution and the total 
DNA coverage on EEs. For example, as the input 
EE:PAE ratio was progressively increased, an fcc 
lattice (space group Fm3m) emerged (Fig. 4, A 
and C). The increase in EE:PAE ratio resulted in 
stronger cumulative bonding interactions (there 
are more DNA bonding connections under such 
circumstances), which was reflected by the in- 
crease in the crystal melting temperature, T,,, 
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from 31° (bec) to 41°C (fcc) (Fig. 4B). In addition, 
if both the total DNA coverage on EEs (charac- 
terized by the total number of duplexed and 
nonduplexed strands) and the input EE:PAE 
ratio in solution increase, the Frank-Kasper A15 
phase (space group Pm3n) emerged (Fig. 4, D 
and E). The formation of the A15 phase may be 
associated with its tendency to minimize the 
contact area between repulsive PAEs, which is 
similar to the trend observed in dendrimer as- 
semblies (fig. S27) (33) but different from the 
Cr3Si structure in binary superlattices (14, 34). 
These phases were also observed in the simu- 
lations (table S9). The three phases realized in 
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this system mimic the metal tungsten, in which 
bee, fec, and A15 structures have all been realized 
either in the bulk or in thin films (35) (table S6). 

Critically, the MD simulations allowed us to 
identify the positions that the EEs reside in at 
low M,, values. For example, when the number 
of DNA linkers per EE was maximized and the 
EE:PAE ratio was decreased, the EEs settled into 
distinct locations in the fec and A15 assemblies, 
as shown by the Boltzmann volumes (Fig. 4, F 
and G) and the corresponding M,, values (Fig. 4, 
H and I). The localized lattice for the fcc struc- 
ture resembled a fully filled high-temperature 
Cu,Se lattice (fig. S34, A,Bio), Whereas the A15 
structure shows two possible configurations, 
either clathrate type I (fig. S35A, AgByg) (36) or an 
unreported lattice (fig. S35B, AgBgy). The latter 
configuration, in which 1. reached a local min- 
imum, contained all sites in the clathrate struc- 
ture that were fully occupied, and the additional 
EEs that could not occupy the lowest energy 
positions began to fill the higher-energy 12f and 
24j positions (Table 1). 

Taken together, this work makes the case for 
describing certain classes of colloidal crystals 
in a fundamentally new way, in which, in the 
case of mobile particles (EEs), the concept of 
metallicity becomes important. By understand- 
ing the factors that govern EE diffusion and 
delocalization, we have a better understanding 
of the structures and phases that can be ac- 
cessed through colloidal crystals, potentially in- 
cluding metals, intermetallics, and complex metal 
alloys. It also challenges the colloidal science com- 
munity to identify exotic new properties that arise 
from the PAE-to-EE transition and structures that 
exhibit high degrees of metallicity as well as to 
develop theoretical models that capture the ef- 
fects that lead to metallicity. 
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Akkermansia muciniphila induces 
intestinal adaptive immune responses 
during homeostasis 


Eduard Ansaldo’, Leianna C. Slayden’, Krystal L. Ching’*, Meghan A. Koch"t, 
Natalie K. Wolf’, Damian R. Plichta”, Eric M. Brown’, Daniel B. Graham”***, 
Ramnik J. Xavier”?**, James J. Moon®, Gregory M. Barton’ 


Intestinal adaptive immune responses influence host health, yet only a few intestinal 
bacteria species that induce cognate adaptive immune responses during homeostasis 
have been identified. Here, we show that Akkermansia muciniphila, an intestinal 
bacterium associated with systemic effects on host metabolism and PD-1 checkpoint 
immunotherapy, induces immunoglobulin G1 (lgG1) antibodies and antigen-specific 

T cell responses in mice. Unlike previously characterized mucosal responses, T cell 
responses to A. muciniphila are limited to T follicular helper cells in a gnotobiotic 
setting, without appreciable induction of other T helper fates or migration to the lamina 
propria. However, A. muciniphila—specific responses are context dependent and adopt 
other fates in conventional mice. These findings suggest that, during homeostasis, 
contextual signals influence T cell responses to the microbiota and modulate host 
immune function. 


dependent (TD) IgA have been identified (2-4). 
Until recently, intestinal IgG antibody responses 
were only thought to occur in the context of 
mucosal barrier disruption (5) or in response to 


daptive immune cells are critical contrib- 
utors to tissue homeostasis in the intestine, 
with B cell-derived immunoglobulin A (IgA) 
playing a major role in establishing barrier 


function (J). T cell-independent (TI) IgA 
recognizes a broad fraction of intestinal microbes. 
By contrast, only a few species that induce T cell- 


enteric pathogens (6) and certain pathobionts 
that breach the intestinal barrier (7). However, 
recent work has revealed that mice also generate 


broadly reactive TI anticommensal IgG2b and 
IgG3 in a largely Toll-like receptor 2 (TLR2)- and 
TLR4-dependent manner (8). 

Despite the abundance of foreign commen- 
sal antigens and the high frequency of effector 
and memory T cells within the intestine, only 
a small number of intestinal bacteria species 
that induce antigen-specific T cell responses 
during homeostasis have been identified (J, 9, 10). 
Inappropriate T cell responses against the mi- 
crobiota are believed to contribute to the patho- 
genesis of inflammatory bowel disease (J1, 12). 
However, anticommensal effector T cell responses 
have also been hypothesized to provide bystander 
protection in the context of enteric infections 
(13). Thus, understanding which commensal 
species induce cognate T cell responses, the sig- 
nals that mediate their induction and regula- 
tion, and their effects on host physiology remain 
important goals. Here, we describe an anticom- 
mensal TD IgGI1 response and use it to identify 
and characterize a commensal species that in- 
duces T cell responses during homeostasis. 

We examined serum antibody binding to in- 
tact commensal bacteria by staining fecal sam- 
ples with paired mouse sera and isotype-specific 
secondary antibodies as previously described (8). 
Surprisingly, comparing wild-type (WT) and T 
cell-deficient (Ti cro! ~) mice revealed that mice 
mount a microbiota-reactive IgG1 antibody re- 
sponse that is entirely dependent on of T cells 
(Fig. 1, A and B, and fig. S1, A to C), whereas 
anticommensal IgG2b, IgG3, and IgA are in- 
duced in a TI manner, as previously reported 
(fig. S1A) (4, 8, 14). The fraction of commensals 
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Fig. 1. Mice generate anticommensal |gG1 antibodies during 
homeostasis. (A) Representative IgG1 flow cytometric analysis of fecal 
microbiota with sera from WT and T cell-deficient (Tcrb~’-) mice. 
Feces and sera originated from the same mouse (paired serum) 
except when using antibody-deficient (Ighm~’-) serum as a negative 
staining control. SYBR Green labels a fraction of the microbiota, 
ensuring that SYBR™ events are bacteria, whereas some of the 
SYBR'° events are also commensals that are less permeable to the 
dye (8). (B) IgGl microbiota flow cytometric analysis compiled from 
eight independent experiments. All mice were housed at UC Berkeley. 
WT, n = 63; Terb-’~, n = 35. (C) IgG1 microbiota flow cytometric 
analysis with paired feces and sera from mice of the indicated genetic 
backgrounds and vivaria. Balb/c mice were from The Jackson Labora- 
tory. Jax B6 and Tac B6 C57BL/6 mice were from the Jackson 
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Laboratory and Taconic Biosciences, respectively. Swiss Webster 
(SW) mice were from Taconic Biosciences (n = 5 mice per group). 
Data are representative of two independent experiments. (D) Results 
from sorting and 16S rDNA sequencing of lgGl-bound and -unbound 
fractions (n = 12 mice). Graph depicts the average log2 ratio of 
abundances between both fractions for each individual OTU and the 
corresponding gq value. Data are representative of two independent 
experiments. Each symbol represents a mouse [(B) and (C)] or 

an OTU (D). Error bars represent mean + SD. Gates on flow cytometry 
plots show mean + SEM. P values were calculated by Kruskal-Wallis 
test followed by Dunn's multiple-comparisons test (B) or by paired-ratio 
Student's t test followed by the Benjamini, Krieger, and Yekutieli 
two-stage false discovery rate (FDR) correction for multiple compar- 
isons, with an FDR of 0.01 (D). 
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Fig. 2. A. muciniphila is necessary and sufficient to induce cognate 
A. muciniphila-specific |gG1 antibody responses. (A) Representative 
IgG1 bacterial flow cytometric analysis of A. muciniphila incubated with the 
indicated mouse sera. Geometric mean fluorescence intensity (gMFl) of 
the assay was quantified in (C) for multiple mice. (B) Quantification 

of A. muciniphila colonization by fecal 16S quantitative polymerase chain 
reaction (qPCR) for mice described in (C). LoD, limit of detection. 

(C) A. muciniphila |gG1 bacterial flow cytometric analysis for mice of the 
indicated genotypes and indicated A. muciniphila colonization status. WT 
Akk~, n = 25; WT Akk*, n = 41; Terb-’~, n = 15. Data are compiled from 
seven independent experiments. (D) Quantification of A. muciniphila 
colonization by fecal 16S qPCR before (WT Akk ) and 5 weeks after (WT 
Akk-colonized) a single A. muciniphila oral gavage of 10° colony-forming 
units. n = 6 mice. Data are representative of three independent 


experiments. (E and F) Representative plot (E) and quantification (F) of 
A. muciniphila |gG1 bacterial flow cytometric analysis using sera from mice 
before (Akk ) and 5 weeks after (Akk-colonized) colonization. n = 5 mice. 
Data are representative of three independent experiments. (G) Quantifi- 
cation of A. muciniphila colonization by fecal 16S qPCR. ASF, n = 6 mice; 
ASF+Akk, n = 16 mice. Data are representative of two independent 
experiments. (H and I) A. muciniphila bacterial flow cytometric analysis 
with serial dilution of serum in ASF and ASF+Akk mice. Each line 
represents one mouse. The x-axis denotes total serum IgG1 (H) or serum 
IgA (1) concentration in the assay. n = 9 mice per group. Data are 
representative of two independent experiments. Each symbol represents a 
mouse; error bars represent mean + SD. P values were calculated with 

a Kruskal-Wallis test followed by Dunn's multiple-comparisons test (B and 
C) or Mann-Whitney test (D, F, and G). 


bound by TD IgG1 was ~10% (Fig. 1B), far less 
than the percentage recognized by IgA, IgG2b, 
and IgG3 (fig. S1A). Thus, mice produced IgG1 
that appeared to recognize only a subset of mi- 
crobes in the intestine, in contrast to the poly- 
reactive TI antibodies that bind diverse bacteria 
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(8, 15). T cell-sufficient (Terb*/-) and Terb~ co- 
housed littermates were compared with control 
for the composition of the microbiota, and an 
anticommensal IgG1 response was found only 
in Terb*’” mice (fig. S1B). Moreover, analysis of 
mice from multiple vendors and of different 
genetic backgrounds demonstrated that this 
IgGl response was a general feature of healthy 
mice (Fig. 1C). 

To identify the commensal species targeted by 
this humoral response, we sorted IgGl-bound 
and -unbound populations from fecal samples 
stained with sera from corresponding mice (fig. 
$2, A and B) and performed 16S ribosomal DNA 
(rDNA) sequencing on the resulting fractions. 
Analysis of operational taxonomic units (OTUs) 
that were significantly enriched in the IgG1-positive 
fractions compared with the IgG1-negative frac- 
tions revealed two major IgG1 targets (Fig. 1D 
and fig. S2, C and D). These OTUs correspond to the 
Akkermansia genus (OTU2) and the Bacteroides 
$24-7 family (OTU63). Bacteroides S24-7 com- 


2019 


prises a poorly characterized family of mouse 
intestinal microbes (16). Akkermansia is a genus 
of commensals in the Verrucomicrobia phylum, 
which until recently, only contained one member, 
Akkermansia muciniphila (17). A. muciniphila can 
degrade mucin (7) and is an abundant member of 
the human intestinal microbiota (78). Coloniza- 
tion with A. muciniphila has been reported to 
have protective effects in diet-induced obesity 
(19, 20), to promote mucosal wound healing (2D, 
and to increase antitumor responses during anti- 
PD-1 immunotherapy (22). The mechanisms by 
which A. muciniphila mediates these diverse ef- 
fects remain poorly understood, as little is known 
about host sensing of this bacterium. Therefore, 
based on the ability of A. muciniphila to induce 
TD IgGl responses and its reported effects on 
host physiology, we sought to characterize the 
immune response to this bacterium. 

We isolated A. muciniphila from mice in our 
colony by plating feces on selective media (17). 
We then used bacterial flow cytometric analysis 
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Fig. 3. A. muciniphila induces antigen-specific Tf, cell responses 
during homeostasis. (A) Representative flow cytometric analysis 
depicting transferred T cells (Thy1.1*) as a percentage of all CD4* T cells 
in the PPs of ASF and ASF+Akk mice 12 days after low-frequency 
adoptive transfer of Amuc124 (TCR-transgenic) T cells. (B) Frequencies 
of transferred T cells in intestinal tissues of ASF and ASF+Akk mice. 

n= 5 mice per group; data are representative of six independent 
experiments. (C and D) Representative flow cytometric analysis of 
expression of Try markers (PD-1, Bcl6, and CXCR5) by endogenous and 
transferred T cells in the PPs of ASF+Akk mice. (E and F) Expression 

of Ty1 (T-bet* FOXP37), Ty2 (GATA3* FOXP37), Ty17 (RORyt* FOXP3°), 
Treg (FOXP3*), or Try (Bcl6* PD-1*) markers by transferred T cells in the 
PPs (E) and SILP (F) of ASF+Akk mice. n = 9 mice. Data are repre- 
sentative of six independent experiments. (G) Representative Am3735- 
1 tetramer flow cytometric analysis of PPs from ASF and ASF+Akk 
mice. (H) Frequencies of Am3735-1- or Am3740-1 tetramer—positive 


Cell number (x10?) 


Ansaldo et al., Science 364, 1179-1184 (2019) 21 June 2019 


SILP LILP mLN PP 


p=0.79 p=0.86 p=0.61 p<i0* @ASF+Akk 


SILP LILP mLN PP 


endogenous T cells in ASF and ASF+Akk mice as a percentage of total 
CD4* T cells. ASF, n = 4 mice; ASF+Akk, n = 5 to 7 mice. Data are 
representative of three (Am3740-1) or six (Am3735-1) independent 
experiments. (1) Frequencies of Am3735-1- or Am3740-1 tetramer—positive 
cells expressing Try markers [PD-1 and CXCR5, as shown in (J)] in the PPs 
of ASF+Akk mice. n = 5 to 7 mice; data are representative of three 
(Am3740-1) or six (Am3735-1) independent experiments. (J) Representa- 
tive flow cytometric analysis of expression of Try markers (PD-1 and 
CXCR5) by endogenous Am3735-1- or Am3740-1 tetramer—positive cells. 
(K) Numbers of Am3735-1- and Am3740-1 tetramer—positive cells in 

all intestinal tissues in ASF and ASF+Akk mice. ASF, n = 4 mice; ASF 
+Akk, n = 5 mice. Data are representative of two (Am3740-1) or three 
(Am3740-1) independent experiments. Each symbol represents a mouse. 
Error bars represent mean + SD. Gates on flow cytometry plots show 
mean + SEM. P values were calculated with unpaired Student's t test (B and 
K) or Mann-Whitney test (H). 
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to confirm the presence of A. muciniphila- 
specific IgG1 antibodies in the sera of mice that 
harbored A. muciniphila at steady state (Fig. 2A). 
These IgG1 antibody responses to A. muciniphila 
could consist of preexisting natural polyreactive 
specificities or could be antigen specific. To dis- 
criminate between these possibilities, we identi- 
fied C57BL/6 mice lacking A. muciniphila in 
their microbiota (Fig. 2B). Comparing IgG1 
antibody responses between A. muciniphila- 
negative and A. muciniphila-positive mice con- 
firmed that the induction of A. muciniphila-specific 
serum IgGl responses required colonization 
with A. muciniphila as well as T cells (Fig. 2, 
A to C, and fig. S3A)..A. muciniphila-positive mice 
also mounted serum.A. muciniphila-specific TD 
IgA responses (fig. S3B). Moreover, de novo col- 
onization of A. muciniphila-negative mice by oral 
gavage was sufficient to induce A. muciniphila- 
specific IgG1 antibodies (Fig. 2, D to F, and fig. 
S3C). Thus, IgG1 responses to A. muciniphila are 
not derived from preexisting cross-reactive spec- 
ificities. Rather, mice mount an antigen-specific 
TD IgG1 antibody response upon A. muciniphila 
colonization. 

We noted that titers of serum IgG1 responses 
against A. muciniphila were variable across 
A. muciniphila-positive mice. A small number 
of mice lacked A. muciniphila-specific IgG] al- 
together despite similar colonization (Fig. 2, B 
and C, and fig. S3A). One explanation for this 
variability is that variation within intestinal 
microbial communities may alter the response 
to A. muciniphila. Indeed, previous studies have 
shown that intestinal infection or inflammation 
can lead to altered bystander responses against 
commensal microbes (23). To overcome such 
complications, we established a defined gnotobiotic 
system to determine whether direct engagement 
of the mucosal immune system by A. muciniphila 
underlies the TD IgG1 response. To this end, we 
introduced A. muciniphila into gnotobiotic C57BL/ 
6 mice colonized with altered Schaedler flora 
(ASF) (24, 25) to generate two mouse colonies 
with identical microbiota except for the pres- 
ence of A. muciniphila in the ASF+Akk colony. 
A. muciniphila colonized ASF+Akk mice to high 
levels and was vertically transmitted (Fig. 2G and 
fig. S3D). We restricted all of our analyses to 
progeny (or progeny of progeny) of ASF+Akk mice 
that acquired A. muciniphila via vertical trans- 
mission. Mice colonized with the ASF+Akk flora, 
but not the ASF flora alone, mounted IgG1 and 
IgA antibody responses specific for A. muciniphila, 
with very consistent titers between mice (Fig. 2, H 
and I). Thus, A. muciniphila directly engages the 
immune system to induce TD IgG1 and IgA. 

To explore T cell responses to A. muciniphila, 
we expanded A. muciniphila-specific T cell lines 
from intestinal tissues of A. muciniphila-colonized 
mice and generated T cell hybridomas by fusion 
with BWZ.36 cells (26) (fig. S4, A and B). We iden- 
tified the antigens recognized by two of these 
hybridomas (124-2 and 168-H10) by screening 
an A. muciniphila genomic expression library 
for clones that stimulated each T cell hybridoma 
(fig. S4, C to E). The 124-2 hybridoma recognized 
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a peptide derived from Amuc_RS03735, an outer 
membrane autotransporter domain-containing 
protein. Next, we generated A. muciniphila- 
specific, T cell receptor (TCR)-transgenic mice 
(Amuci124) using the TCRa and TCR§ chains 
from the 124-2 T cell hybridoma (figs. S4A and 
S5, A and B). Importantly, we maintained Amucl24 
mice free of A. muciniphila colonization for all 
subsequent experiments. 

We sought to track the T cell response to 
A. muciniphila by performing low-frequency 
adoptive transfers of naive, congenically marked 
(Thyl.1) Amuci24 CD4* T cells into ASF and 


A SPF Akk- C57BL/6 
PP, all CD4* T cells 


SPF Akkt C57BL/6 B 
PP, all CD4* T cells 1.0 


ASF+Akk mice (fig. S6A)..A. muciniphila-specific 
T cells expanded in ASF+Akk mice but were 
undetectable in ASF mice, indicating that A. 
muciniphila antigens are presented under ho- 
meostatic conditions (Fig. 3, A and B; fig. S6, A 
and B; and fig. S3A). Amucl24 T cells were localized 
to the Peyer’s patches (PPs) and to the mesenteric 
lymph nodes (mLNs), but very few cells were 
found in the large intestine or small intestine 
lamina propria (LILP or SILP, respectively) (Fig. 3, 
A and B, and fig. S6B). The majority of trans- 
ferred T cells expressed the T follicular helper 
(Ty) cell markers PD-1, Bcl6, and CXCR5 (Fig. 3, 
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Fig. 4. A. muciniphila—specific T cells also adopt other fates in the context of a complex 
microbiota. (A) Representative flow cytometric analysis depicting transferred Amuc124 T cells 
(Thy1.1*) as a percentage of all CD4* T cells in the PPs of SPF Akk” and SPF Akk* mice. (B) Fre- 
quencies of transferred T cells as a percentage of all CD4* T cells in intestinal tissues of conventional 
SPF A. muciniphila—negative (n = 4) and SPF A. muciniphila—positive (n = 5) mice. Data are 
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representative of three independent experiments. (C and D) Expression of Ty1 (T-bet* FOXP37), Ty2 
(GATA3* FOXP3°), Ty17 (RORyt* FOXP3°), Treg (FOXP3"), or Try (Bcl6* PD-1*) markers by 
transferred T cells in the PPs (C) or SILP (D) of SPF A. muciniphila—positive mice. n = 5 mice. Data 
are representative of three independent experiments. (E) Representative flow cytometric analysis of 
expression of Ty] and Ty17 markers by endogenous total CD4* T cells and A. muciniphila—specific 
(transferred) T cells in the SILP of SPF A. muciniphila—positive mice. Each symbol represents a 
mouse. Error bars represent mean + SD. Gates on flow cytometry plots show mean + SEM. P values 
were calculated with unpaired Student's t test (B). 
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C to E, and fig. S6, C and D), with a small per- 
centage adopting the regulatory T cell (Treg) 
marker FOXP3. The small number of T cells 
detectable in the LP were also skewed toward a 
Tru cell phenotype, most likely indicating that 
secondary or tertiary lymphoid tissues were not 
completely excluded from these preparations 
(Fig. 3F). Transferred Ragi*/* or a Ragt’~ 
Amucl24 T cells yielded similar results (fig. $7, A 
to D). Surprisingly, transferred T cells express- 
ing markers for T helper 1 (Ty1), Ty2, or Ty17 
cells (T-bet, GATA3, or RORyt, respectively) were 
not detected in appreciable numbers (fig. S6D). 
To track endogenous A. muciniphila-specific 
T cells, we generated I-A” tetramers loaded with 
the peptide TLYIGSGAILS (Thr-Leu-Tyr-Ile-Gly- 
Ser-Gly-Ala-Ile-Leu-Ser) from the outer mem- 
brane protein Amuc_RS03735 (Am3735-1) (fig. 
S4, C to F). Endogenous, tetramer-positive, 
A. muciniphila-specific T cells were identified in 
the PPs of ASF+Akk but not ASF mice (Fig. 3, G 
and H), consistent with the results obtained with 
Amucl24 T cells after adoptive transfer. More- 
over, the endogenous A. muciniphila-specific 
T cells identified by tetramer staining also ex- 
pressed Ty; cell markers (Fig. 3, I and J). An in- 
dependent tetramer with a different Akkermansia 
epitope from Amuc_RS03740 (Am3740-1) yielded 
very similar results (Fig. 3, H to J). Finally, we 
probed all gut-associated lymphoid tissue (LILP, 
PPs, and SILP) and mLNs with both tetramers. 
A. muciniphila-specific T cells were detected in 
the PPs but not in the LP (Fig. 3K), which con- 
firmed results obtained with Amuc124 T cell 
transfers. Thus, A. muciniphila induces antigen- 
specific T cell responses in the intestine that 
manifest primarily as Ty cells in the PPs. 
Finally, we returned to specific pathogen-free 
(SPF) mice, in which we had observed greater 
variability in the A. muciniphila IgG1 response, 
and characterized the T cell response to A. 
muciniphila in the context of a conventional 
microbiota. Similar to our findings for the ASF 
system, transferred Amuc124 T cells expanded and 
localized to the PPs in A. muciniphila-positive 
mice but were undetectable in A. muciniphila- 
negative mice (Fig. 4, A and B, and fig. S8A). The 
majority of transferred T cells in the PPs also 
adopted Ty; cell markers (Fig. 4C and fig. S8B). 
Therefore, A. muciniphila also induces a Ty; cell 
response in the context of a complex microbiota. 
However, unlike the ASF+Akk system, greater 
numbers of transferred T cells were detected in 
the intestinal LP of A. muciniphila-positive mice 
(Fig. 4B), some of which adopted markers con- 
sistent with proinflammatory T cell fates (Fig. 4, 
D and E, and fig. S8, C to E). Consistent with 
the variable A. muciniphila-specific IgG1 titers 
observed in SPF mice (Fig. 2C), some cohorts of 
SPF Akk* mice lacked detectable T cell activation 
and proliferation after transfer despite the pres- 
ence of A. muciniphila. Thus, T cell responses to 
A. muciniphila appear to be context dependent, 
resulting in the induction of other CD4 T effector 
fates in addition to Ty; cells in certain condi- 
tions. Interestingly, and in contrast to what has 
been described for segmented filamentous bacteria 
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(SFB) and Helicobacter spp. (9, 10, 27), T cell re- 
sponses to A. muciniphila in SPF mice were mixed 
between different CD4 T cell fates (fig. SSE). 

Commensal-specific TD antibodies were pre- 
viously thought to be restricted to IgA responses 
specific to a small subset of commensal species 
(2). Our work reveals that such TD antibodies 
also include IgG1, which recognizes a small sub- 
set of commensals, including A. muciniphila. 
These bacteria may share certain features, such 
as proximity to the intestinal epithelium, which 
may increase their potential to cause disease 
during barrier disruption. Indeed, systemic anti- 
bodies specific for commensal bacteria have been 
reported to protect against gut-derived septicemia 
(7, 28), and. A. muciniphila can promote disease in 
certain immunodeficient settings (29). 

Very few commensal antigens have been iden- 
tified to date that lead to antigen-specific T cell 
responses in the gut during homeostasis. SFB 
and Helicobacter spp. both elicit very defined 
responses. SFB induce RORyt* Ty17 cells both 
in conventional and monocolonized mice (27), 
whereas Helicobacter spp. induce RORyt* FOXP3* 
iTyeg cells in the LILP (9, 10). Try cells made up a 
small proportion of SFB- and Helicobacter spp.- 
specific T cells, but these were in the context of 
Tyl7 cell- or T,,..-dominated responses, respec- 
tively (0). Try cells in the intestine have been 
suggested to differentiate from either Tj;17 cells 
or Tyegs (30-32). By contrast, the ASF+AkkK sys- 
tem produced commensal-specific T cell responses 
dominated by the induction of Ty} cells, with very 
few Treg, Ty1, Ty2, or Ty17 cells. Thus, commensal- 
specific Ty responses can occur in the absence 
of a primary CD4 T cell response of a different 
fate and may differentiate from naive commensal- 
specific T cells in the mesenteric lymph nodes 
or PPs. Consequently, A. muciniphila appears 
to engage the mucosal immune system in a 
manner distinct from previously described 
T cell-activating commensal bacteria. This 
commensal-specific Tpy response appears in 
conjunction with robust anticommensal TD 
IgGl and IgA. 

Surprisingly, in the context of a conventional 
microbiota, and in addition to the induction of 
Tru Cells, A. muciniphila can induce CD4 T cells 
of other fates that home to the LP. Together, 
these results support the hypothesis that T cell 
responses against commensals can be context de- 
pendent, not just in the setting of acute gastro- 
intestinal infection or inflammation (23), but 
also during homeostasis. Interactions with certain 
microbes may change the localization or function 
of A. muciniphila, or signals provided by other 
microbes may shape the immune response against 
this commensal bacterium. 

Prior work has established that A. muciniphila 
mediates effects on host metabolism (19, 20) and 
can influence the efficacy of anti-PD-1-based 
immunotherapy against cancer (22). The mech- 
anisms for these effects remain poorly under- 
stood, but both appear to be immune mediated 
and correlated with type 1 immunity. In partic- 
ular, responses to anti-PD-1-based immunother- 
apy in humans were correlated with interferon 
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gamma production by peripheral T cells incu- 
bated with A. muciniphila antigens in vitro (22). 
Interestingly, not all patients generated type 
lresponses against A. muciniphila. Our results 
provide a potential explanation for this varied 
response and suggest that differential skewing 
of A. muciniphila-specific T cell responses in 
individuals due to differences in microbiota com- 
position or other environmental signals may 
have profound systemic effects. Defining the 
mechanisms by which these commensal-specific 
T cell responses can be skewed toward different 
fates is an important goal with clear clinical 
implications. 
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Structural identification of a hotspot 
on CFTR for potentiation 


Fangyu Liu®?*, Zhe Zhang’*, Anat Levit®, Jesper Levring’, Kouki K. Touhara‘*+, 
Brian K. Shoichet®, Jue Chen’? 


Cystic fibrosis is a fatal disease caused by mutations in the cystic fibrosis transmembrane 
conductance regulator (CFTR). Two main categories of drugs are being developed: correctors 
that improve folding of CFTR and potentiators that recover the function of CFTR. Here, we report 
two cryo-electron microscopy structures of human CFTR in complex with potentiators: 

one with the U.S. Food and Drug Administration (FDA)—approved drug ivacaftor at 3.3-angstrom 


resolution and the other with an investigational drug, GLPG1837, at 3.2-angstrom resolution. 
These two drugs, although chemically dissimilar, bind to the same site within the transmembrane 
region. Mutagenesis suggests that in both cases, hydrogen bonds provided by the protein are 
important for drug recognition. The molecular details of how ivacaftor and GLPG1837 interact with 
CFTR may facilitate structure-based optimization of therapeutic compounds. 


he cystic fibrosis transmembrane conduct- 
ance regulator (CFTR) is an anion channel 
widely expressed on epithelial surfaces of 
different organs, including the lung and 
intestine (2). It belongs to the family of the 
ATP-binding cassette (ABC) transporters, but 
functions as an anion channel. CFTR consists of 
two transmembrane domains (TMDs) that form 
the pore, two cytoplasmic nucleotide-binding 
domains (NBDs) that bind and hydrolyze aden- 
osine 5°-triphosphate (ATP), and a regulatory 
(R) domain that must be phosphorylated to al- 
low the channel to open (2). More than 300 muta- 
tions have been identified to cause cystic fibrosis 


Fig. 1. lvacaftor binds CFTR inside the membrane. (A) Overall 
structure of the phosphorylated, ATP-bound human CFTR in complex 
with ivacaftor (shown in magenta). TMD1 and NBD1 are shown in blue, 
TMD2 and NBD2 in green, and R domain in red. TM8 is highlighted in 
cylinder representation. Regions not resolved in the structure are shown 
as dashed lines. (B) Superposition of phosphorylated, ATP-bound CFTR 
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(CF); details on the variants are given at the 
CFTR2 website (3). The most prevalent muta- 
tion is the deletion of a single amino acid, F508, 
which makes CFTR prone to degradation before 
reaching the cell’s plasma membrane (4). Other 
mutants, such as R117H and G551D, are ex- 
pressed on the cell membrane but do not gate 
properly (5). 

Over the past eight decades, medical advances 
have improved the treatment of cystic fibrosis. 
The average survival age of patients has been 
lengthened from early infancy in the 1930s to 
around 47 years at present. Most treatments 
offer symptomatic relief, including pancreatic 


OM + ATP + Ivacaftor 
1 +ATP only 


enzyme supplements to aid digestion, antibiotics 
to prevent and treat infection, mucus-thinning 
drugs to clear the airway, and lung transplants. 
Recently, therapies have been developed to tar- 
get the CFTR protein. Small-molecule CFTR 
modulators include correctors that increase 
the abundance of CFTR at the cell surface and 
potentiators that increase the ion flux of mu- 
tant CFTR (6-8). Currently, two correctors 
(umacaftor and tezacaftor) and one potenti- 
ator (ivacaftor), all developed by Vertex Pharma- 
ceuticals, are available to patients (9-17). In 
addition, many other candidates to enhance 
the function of CFTR are in the drug discovery 
pipeline (7, 12, 13). All of these CFTR modu- 
lators were discovered through intensive high- 
throughput screening and iterative medicinal 
chemistry optimization. Rational drug discov- 
ery has not been feasible, owing to the lack of 
structural information. To address this issue, 
we report here cryo-electron microscopy (EM) 
structures of the human CFTR in complex with 
two different potentiators: the Vertex drug 
ivacaftor (6) and GLPG1837, an investigational 
drug developed by Galapagos (7). 
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in the absence (yellow) and presence of ivacaftor (magenta). (C) A 
magnified view of the ivacaftor-binding site. CFTR is shown as a 
transparent surface model with TM 4, 5, and 8 indicated. Ivacaftor is 
shown as a stick model together with the corresponding EM density. 
(D) lvacaftor binds at the protein-lipid interface, exposing half of its 
surface to the lipid bilayer. 
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Fig. 2. Contribution of individual residues to ivacaftor binding. 

(A) Schematic drawing of interactions formed between ivacaftor (magenta) 
and the CFTR-binding site. Residues within van der Waals distances 

(<4.5 A) are shown. Representations: black dashed lines, hydrogen bonds; 
blue vertical lines, aromatic interactions; spokes, hydrophobic interactions. 
The hinge region in TM8 is shown as black sticks and labeled. (B) Stereo 
view of the ivacaftor-binding site. Residues within van der Waals distances 
are shown in yellow, and hydrogen bonds are depicted as black dashed 


Ivacaftor was discovered by screening com- 
pounds that increase anion flux in G551D- 
CFTR-expressing cells (6). Subsequent studies 
have shown that ivacaftor increases the open 
probability (P,) of both wild-type (WT) and mu- 
tant CFTRs in membrane patches, proteolipo- 
somes, and planar lipid bilayers (14-16). The 
potentiation by ivacaftor requires phospho- 
rylation of CFTR by protein kinase A (PKA), 
but is independent of ATP (15). These results 
suggest that ivacaftor acts directly on CFTR, 
rather than functioning through other regu- 
latory mechanisms. 

To describe the specific molecular interactions 
between ivacaftor and CFTR, we determined a 
cryo-EM structure of ivacaftor in complex with 
phosphorylated E1371Q CFTR in the presence of 
saturating ATP-Me”* (10 mM) (Fig. 1A, figs. $1 to 
$3, and table S1). The final map has an overall 
resolution of 3.3 A, showing well-defined density 
throughout the protein, except for the R domain. 
Density for both ATP-Mg”* molecules are visible 
at the NBD dimer interface (fig. S3). An addi- 
tional strong density is observed on the outer 
surface of the TMDs in the center of the lipid 
bilayer (Fig. 1C and fig SIC). This density, not 
observed in any of the previous CFTR structures 
(17-20), has a shape and size consistent with 
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the chemical structure of ivacaftor (Fig. 1C and 
fig S3). Within this density, we built a model of 
ivacaftor and examined multiple orientations of 
it in the site, using molecular docking (27, 22) 
followed by energy minimization of the protein- 
ligand complex. Complexes were prioritized by 
their energetic complementarity, ability to make 
favorable polar interactions, and subsequent re- 
finement to the electron density maps. 

We previously reported the structure of the 
phosphorylated E1371Q construct in the pres- 
ence of ATP-Mg”* but in the absence of iva- 
caftor (20). The ivacaftor-bound E1371Q exhibits 
the same protein conformation in which the ATP- 
bound NBDs form a closed dimer, and the two 
TMDs pack closely together to form an ion con- 
duction pathway open to the cytoplasmic solu- 
tion. The R domain, largely unstructured, is 
located along the peripheral surface of NBD1 and 
the cytoplasmic region of the TMDs (Fig. 1A). 
No significant protein conformational changes 
were observed upon binding of ivacaftor, and 
the overall root mean square deviation between 
the two structures is 0.14 A (Fig. 1B). 

Ivacaftor binds CFTR at the protein-lipid inter- 
face, docking into a cleft formed by transmem- 
brane (TM) helices 4, 5, and 8 (Fig. 1, C and D). 
The binding site coincides with a hinge region in 
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lines. An unknown density between R933 and ivacaftor is shown as 

green mesh. (C to E) Binding affinities of WT CFTR and mutants replacing 
residues making (C) hydrogen bonds, (D) aromatic interactions, and 

(E) hydrophobic interactions with ivacaftor. Data points represent 

the means and SEMs of at least three measurements. The calculated 

Kg values are listed in Table S2. Single-letter abbreviations for the amino 
acid residues are as follows: A, Ala; D, Asp; E, Glu; F, Phe; G, Gly; L, Leu; 
M, Met; Q, Gin R, Arg; S, Ser; and Y, Tyr. 


TMS, a structural feature of CFTR not found in 
other ABC transporters (19, 20). The extracellular 
segment of TM 8 rotates around this hinge upon 
ATP binding (17-19); stabilizing this rotation may 
explain the drug’s efficacy. Whereas about 40% 
of the molecular surface of ivacaftor is buried 
against CFTR, the remaining 60% is exposed to 
the hydrophobic region of the membrane (Fig. 1, 
Cand D). 

The interactions between ivacaftor and CFTR 
include two hydrogen bonds, two aromatic inter- 
actions, and six hydrophobic interactions (Fig. 2, 
A and B). To evaluate how each residue con- 
tributes to ivacaftor binding, we developed a 
scintillation proximity assay (SPA) to measure 
the apparent affinity of ivacaftor for CFTR (Fig. 
2, C to E). Every residue in the binding site was 
individually substituted by alanine. Every mu- 
tant eluted from the size-exclusion column as a 
monomeric peak, similar to the WT CFTR, in- 
dicating that these mutations did not alter CFTR 
folding (fig. $4). Specific binding of ivacaftor to 
the WT CFTR increased as a function of ivacaftor 
concentration (Fig. 2C). Nonlinear regression 
analysis shows that the data fit well to a single- 
site binding model with an equilibrium dissoci- 
ation constant (Kq) of 6.6 + 1.2 nM. In comparison, 
all but one mutation resulted in a reduced binding 
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Fig. 3. GLPG1837 binds to the same site as ivacaftor. (A) Represen- 
tative recordings of WT (upper trace) and E1371Q (lower trace) CFTR 
reconstituted in synthetic lipid bilayers. CFTR was phosphorylated 
with PKA prior to fusion with bilayers. Recordings were performed on 
individual membranes with 2 mM ATP before (left) and after (right) 
addition of 10 uM GLPG1837. (B) Open probabilities of WT and E1371Q 
CFTR before (open bar) and after (filled bar) addition of 10 uM 
GLPG1837. Data points represent the means and SEMs of three to nine 
membranes. P, = 0.23 + 0.02, for WT; P, = 0.54 + 0.08, for WT + 
GLPG1837; P, = 0.64 + 0.03, for E1371Q; P, = 0.88 + 0.03, for E1371Q + 
GLPG1837. ***p < 0.001 by Student's t test. (©) The presence of 
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1.5 uM GLPG1837 shifts the apparent Kg of ivacaftor from 6.6 + 1.2 nM 
(dashed line) to 54 + 4 nM (solid line, n = 9). (D) Competition 

binding assay. lvacaftor was kept at a constant concentration of 8 nM, 
and its binding to WT CFTR is plotted as a function of GLPG1837 
concentration. Kj = 0.30 + 0.08 uM. (E) Ribbon diagram of the 
phosphorylated, ATP-bound CFTR in complex with GLPG1837 (stick 
representation, orange). (F) EM density at the potentiator-binding site 
in the potentiator-free (left), ivacaftor-bound (center), and GLPG1837- 
bound (right) reconstructions. Densities corresponding to CFTR are 
shown in gray, whereas densities only observed in the potentiator-bound 
maps are shown as green meshes. All maps are contoured at 9o. 
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Fig. 4. Molecular details of GLPG1837 binding. (A) Schematic drawing 
of the interactions between GLPG1837 (orange) and CFTR. Hydrogen 
bonds are represented by black dashed lines, and hydrophobic interactions 
are shown by the spokes. (B) Stereo view of the GLPG1837 binding 

site. Residues within van der Waals distances (<4.5 A) are shown as 

blue sticks, and hydrogen bonds are depicted as black dashed lines. 

An unknown density between R933 and GLPG1837 is shown in green 
mesh. (C to E) Upper panel: Representative macroscopic current traces 
of WT and mutant CFTR in response to GLPG1837 perfusion. Different 


affinity for ivacaftor (Fig. 2, C to E, and table S82). 
Four residues appear to be most important— 
their alanine substitutions nearly abolishing 
ivacaftor binding (Fig. 2, C and D). Among 
them, S308 and F312 directly coordinate the 
oxoquinoline moiety through a hydrogen bond 
and a n-n stacking interaction, respectively (Fig. 
2, A and B). The other two residues, R933 and 
Y304A, hydrogen bond to main-chain carbonyls 
in the TM 8 hinge, thus stabilizing the overall 
structure of the binding site (Fig. 2, A and B). 
Another important residue, F931, forms an edge- 
to-face interaction with the phenol ring of ivacaftor. 
Mutation of F931 to alanine decreased drug affi- 
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nity by about 10-fold (Fig. 2D and table S2). Mu- 
tating F305, L233, and F236, the three residues 
within van der Waals distance from the oxoqui- 
noline of the drug, also decreased its affinity (Fig. 
2E and table S2). By contrast, substitution of 
F932, which interacts with one of the lipid- 
exposed tert-butyl groups, had no effect (Fig. 2E 
and table S2). These mutational effects are con- 
sistent with the structure of the CFTR-ivacaftor 
complex, indicating that hydrogen bonds are cri- 
tical for ligand recognition in the low-dielectric 
environment of the membrane. 

The structure of CFTR-ivacaftor complex large- 
ly explains the structure activity relationship 
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concentrations of GLPG1837 applied after channel activation are 
marked above the trace. Lower panel: Dose-response curves 

with estimated ECs9 values. CFTR-containing membrane patches 
were fully phosphorylated by PKA in the presence of saturating 
amount of ATP before GLPG1837 titration. The total current at 3 uM 
GLPG1837 was used to normalize the current potentiated by different 
concentrations of GLPG1837. The dose responses were fitted with 
the Hill equation. Each data point represents values determined from 


(SAR) for the series of analogs that led to this 
drug (6). From the published SAR, 48 ivacaftor 
analogs, ranging from the initial high-throughput 
screening hit to optimized leads, may be readily 
docked into the ivacaftor site, making favorable 
interactions (fig. S5). The docked poses super- 
pose with the ivafactor structure, recapitulating 
more and more of the the drug’s interactions 
with CFTR as the molecules are optimized. Key 
interactions common to most of the docked 
complexes include the internal hydrogen bond 
between the conserved side-chain amide and 
the ubiquitous oxoquinoline oxygen, the hydro- 
gen bond between the main-chain nitrogen of 
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F931 and that same amide, and the interaction 
between the oxoquinoline nitrogen and $308. 
Similarly, the stacking observed between F312 
and ivacaftor’s oxoquinoline ring is conserved 
among the analogs. The phenolic hydroxyl, which 
appears late in the affinity maturation and is 
retained in ivacaftor, docks to interact with R933, 
as observed in the ivacaftor complex; addition of 
this group substantially increases affinity, at least 
partly reflecting its new interaction with the 
arginine (fig. S5; a more detailed analysis is pres- 
ented in the supplementary text). These results 
are consistent with the ivafactor structure deter- 
mined here and the SAR observed in its de- 
velopment (6). 

Recently, a new potentiator, GLPG1837, has 
been discovered to have higher efficacy than that 
of ivacaftor (7). We first studied the effects of 
GLPG1837 in a planar bilayer system, where 
detergent-purified CFTR channels were recon- 
stituted into liposomes then fused with a bilayer 
lipid membrane. At saturating ATP concentra- 
tion, the P, of the phosphorylated WT CFTR 
increased from 0.23 to 0.54 upon addition of 
10 uM GLPG1837 (Fig. 3, A and B). The P, of 
E1371Q was also increased by GLPG1837, from 
0.64 to 0.88 (Fig. 3, Aand B). The P, values are 
lower than those measured in cellular mem- 
branes (23, 24), possibly owing to differences in 
their lipid compositions. A competitive binding 
assay shows that GLGP1837 reduced the ap- 
parent affinity of ivacaftor (Fig. 3C); the inhibi- 
tion constant (K;) was determined to be 0.30 + 
0.08 uM (Fig. 3D). 

Next, we determined the cryo-EM structure of 
phosphorylated E1371Q CFTR in complex with 
ATP-Mg”* and GLPG1837 (Fig. 3E, figs. S6 to S8, 
and table $3). The overall structure, at 3.2 A 
resolution, is essentially indistinguishable from 
those of drug-free and ivacaftor-bound forms. 
Although GLPG1837 clearly binds in the same 
pocket, its orientation and shape differ from 
that of the ivacaftor density (Fig. 3F). Here too, 
a model of the CFTR-GLPG1837 complex was 
built by molecular docking, followed by energy 
minimization and refinement of the ligand- 
receptor complex. In the final model, the drug 
fits well into the electron density, with favorable 
polar and nonpolar interactions, and a relatively 
unstrained ligand geometry (Fig. 3F). 

Although the chemical structure of GLPG1837 
differs from that of ivacaftor, residues interacting 
with GLPG1837 largely overlap with those en- 
gaging ivacaftor (Fig. 4, A and B). Specifically, 
$308 and Y304 form hydrogen bonds; and L233, 
F236, F305, A309, F312 form hydrophobic inter- 
actions with the drug (Fig. 4, A and B). Four 
polar groups on GLPG1837 are engaged in in- 
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tramolecular interactions (Fig. 4, A and B), also 
observed in the crystal structure of the com- 
pound itself (7), that shield charges, permitting 
this relatively polar molecule to permeate the 
membrane. 

To evaluate the contribution of the intermo- 
lecular hydrogen bonds, we measured the half- 
maximal effective concentration (EC;,) values of 
GLPG1837 for the WT, Y304A, and S308A CFTR 
(Fig. 4, C to E). The ECso value of the WT CFTR 
was determined to be 0.12 + 0.03 uM (Fig. 4C), 
similar to the reported value of 0.23 + 0.12 uM 
(24). Replacing Y304 or S308 with an alanine 
increased the ECs, values by 57- and 34-fold, 
respectively (Fig. 4, D and E), underscoring the 
importance of the structurally observed hydro- 
gen bonds in GLPG1837 recognition. 

Here, we have described the structures of 
CFTR in complex with two separate potentia- 
tors. These structures allow us to reach several 
conclusions. First, although these two potentia- 
tors are chemically dissimilar, they both bind to 
the same site within the transmembrane region 
of CFTR. This explains why ivacaftor and 
GLPG1837 are competitive in electrophysio- 
logical and binding assays (Fig. 3, C and D) 
(24). Second, because the drug binding site 
coincides with a hinge involved in gating (19, 20), 
we propose that the presence of a drug in the 
pocket stabilizes the open configuration of the 
pore relative to the closed. In electrophysiolog- 
ical experiments, this stabilization is manifested 
as an increased opening rate and a decreased 
closing rate (Fig. 3A) (14, 24). The absence of 
observable protein structural differences be- 
tween the drug-bound and drug-free conforma- 
tions is not surprising given that an open 
probability increase from 0.64 to 0.88 (Fig. 3A) 
corresponds to a drug-induced energy change in 
the closed-open equilibrium (ie., AAG) of just 
over 1 kT (where #’g is the Boltzmann constant 
and T is temperature). Third, the drug-binding 
pocket identified here is likely a hotspot for the 
action of CFTR potentiators. It is now possible 
to use these structures of CFTR bound to two 
different potentiator molecules, together with 
computation, to identify new potentiators. 
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CORAL REEFS 


Demographic dynamics of the 
smallest marine vertebrates fuel 
coral reef ecosystem functioning 


Simon J. Brandl’?*, Luke Tornabene’, Christopher H. R. Goatley*, 
Jordan M. Casey”®”, Renato A. Morais®®, Isabelle M. Cété’, Carole C. Baldwin’®, 
Valeriano Parravicini®’®, Nina M. D. Schiettekatte”’®, David R. Bellwood? 


How coral reefs survive as oases of life in low-productivity oceans has puzzled scientists 
for centuries. The answer may lie in internal nutrient cycling and/or input from the 
pelagic zone. Integrating meta-analysis, field data, and population modeling, we show 
that the ocean’s smallest vertebrates, cryptobenthic reef fishes, promote internal reef 
fish biomass production through extensive larval supply from the pelagic environment. 
Specifically, cryptobenthics account for two-thirds of reef fish larvae in the near-reef 
pelagic zone despite limited adult reproductive outputs. This overwhelming abundance 
of cryptobenthic larvae fuels reef trophodynamics via rapid growth and extreme mortality, 
producing almost 60% of consumed reef fish biomass. Although cryptobenthics are 
often overlooked, their distinctive demographic dynamics may make them a cornerstone 
of ecosystem functioning on modern coral reefs. 


ow coral reefs maintain high diversity 

and productivity in oligotrophic tropical 

oceans—often termed “Darwin’s paradox”— 

remains poorly understood (7-3). Both 

pelagic subsidies (2) and internal energy 
and nutrient cycling (3) have been put forward 
as drivers of productivity on reefs, but their rel- 
ative importance has not been resolved. 

Fishes form the largest reservoir of consumer 
biomass on reefs and are involved in most inter- 
nal energy and nutrient fluxes (4), but they also 
bridge the pelagic zone-reef interface during 
their larval development (5) and may therefore 
subsidize reefs with pelagic productivity (6). 
Over the past two decades, reconstructions of 
larval dispersal pathways (7, 8) have revealed the 
importance of larval dynamics for reef fish ecol- 
ogy, evolution, and conservation (9-17). However, 
the role of larval stages in coral reef ecosystem 
functioning remains virtually unknown. 


Department of Biological Sciences, Simon Fraser University, 
Burnaby, BC V5A 186, Canada. *Tennenbaum Marine 
Observatories Network, Smithsonian Institution, Edgewater, 
MD 21037, USA. School of Aquatic and Fishery Sciences 
and the Burke Museum of Natural History and Culture, 
University of Washington, Seattle, WA 98105, USA. 
‘Function, Evolution and Anatomy Research (FEAR) Lab and 
Palaeoscience Research Centre, School of Environmental and 
Rural Science, University of New England, Armidale 2351, 
Australia. °PSL Université Paris: EPHE-UPVD-CNRS, USR 
3278 CRIOBE, Université de Perpignan, 66860 Perpignan, 
France. Laboratoire d’Excellence “CORAIL,” Perpignan, 
France. Department of Invertebrate Zoology, National 
Museum of Natural History, Smithsonian Institution, 
Washington, DC 20650, USA. °ARC Centre of Excellence for 
Coral Reef Studies, James Cook University, Townsville 4811, 
Australia. °College of Science and Engineering, James Cook 
University, Townsville 4811, Australia. °Department of 
Vertebrate Zoology, National Museum of Natural History, 
Smithsonian Institution, Washington, DC 20650, USA. 
*Corresponding author. Email: simonjbrandl@gmail.com 


Brandl et al., Science 364, 1189-1192 (2019) 


Integrating published surveys of coral reef 
fish larvae, new data on adult reef fish commu- 
nities, and a theoretical population model (72), we 
demonstrate that cryptobenthic reef fishes [a 
group of 17 reef-associated fish families charac- 
terized by species <50 mm in length (72, 13)] play 
a previously unrecognized but critically impor- 
tant role in coral reef ecosystem functioning be- 
cause of their larval dynamics, rapid growth, and 
extreme mortality. 

Cryptobenthic larvae greatly outnumber large 
reef fish larvae near coral reefs (Fig. 1) despite 
limited adult reproductive outputs. As more ga- 
metes produce more larvae, family-specific larval 
abundance predictably increases with gamete 
output for both cryptobenthic and large reef 
fishes [calculated from adult densities, fecundity, 
and spawning frequency (12, 14, 15)]. However, 
the respective slopes differ drastically, with the 
relationship for cryptobenthics being more than 
an order of magnitude steeper than that for large 
reef fishes (Fig. 2 and table S1), and this differ- 
ence is consistent across the locations studied 
(Australia, Belize, and French Polynesia). We 
found no statistical evidence for effects of pelagic 
larval duration (i.e., the time that larvae spend 
in the plankton) or broadcast (i-e., eggs released 
into the water column) versus benthic clutch 
spawning (i.e., distinct clutches of guarded eggs) 
on larval supply (table $2) despite the high prev- 
alence of benthic clutch spawning in crypto- 
benthics. Furthermore, the difference in slopes 
was not affected by the uncertainty in the input 
data, including increases in spawning frequency 
for cryptobenthics (fig. S2 and tables S3 and S4) 
and uncertainties in the body mass-fecundity 
relationship, adult densities, and larval compo- 
sition (figs. S3 to S5). Thus, cryptobenthics are 
disproportionally more successful at converting 
adult gametes into larval supply, which likely 
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underpins their extreme abundance in the near- 
shore ichthyoplankton. 

The abundance of cryptobenthic larvae, de- 
spite limited gamete output, may provide the 
continuous (J6) and copious inflow of larvae that 
is assumed necessary for adult population main- 
tenance in small, short-lived reef fishes (77). Our 
results show that this, in turn, forms the basis of 
a critical energy and nutrient pump that oper- 
ates across the pelagic zone-reef interface and 
may help to explain the enigmatic productivity 
of coral reef ecosystems (Fig. 3). Using a demo- 
graphic model that simulates the daily arrival, 
growth, and death of larval recruits from all reef 
fish families over 1 year (fig. S6), we estimate that 
juvenile and adult cryptobenthics provide most 
(57.5 + 0.1% SE) of the consumed fish biomass 
on reefs. However, because of extreme mortality 
rates and small sizes, cryptobenthics appear to 
make negligible contributions to net productivity 
and standing fish biomass (Fig. 3C), which mir- 
rors empirical evidence (18). Standing biomass 
is the most commonly quantified metric of eco- 
system functioning on reefs (19); however, it 
does not capture the rapid turnover in crypto- 
benthic populations (693.1 + 2.7% SE annually) 
that is enabled by their extraordinary demo- 
graphic dynamics. Thus, cryptobenthics repre- 
sent the “dark productivity” of coral reefs, which 
fuels reef fish biomass production but is rarely 
perceived because it is consumed almost as quickly 
as it is produced. 

This role of cryptobenthics is empirically re- 
flected in their extreme mortality [up to 70% per 
week (20, 27)] and their consumption by virtually 
any predator capable of eating them (2/, 22). Al- 
though the community-wide representation of 
cryptobenthics in fish diets frequently appears 
lower than the ~58% identified herein (23), their 
true contribution to coral reef trophodynamics 
may be obscured by (i) rapid digestion, preclud- 
ing reliable visual identification (24); (ii) preda- 
tion on cryptobenthics by invertebrates (22), 
which are fed on by larger fishes; and (iii) pre- 
dation on cryptobenthics by juvenile predatory 
fishes [e.g., cryptobenthics comprise up to 88.6% 
of fish prey for juvenile groupers (25, 26)], which 
are rarely included in community-wide dietary 
analyses. 

The key to the unique demographic dynamics 
of cryptobenthics and their productivity might 
be a shift away from long-range dispersal toward 
retention of larvae in the immediate vicinity of 
natal (home) reefs. Four lines of evidence, along 
with our findings, support this hypothesis. First, 
larval dispersal models show that larval supply 
easily maintains adult populations in large-bodied, 
long-lived reef fishes (7). Conversely, small-bodied, 
short-lived taxa appear unable to sustain local 
populations even when active swimming by larvae 
is considered (7), yet cryptobenthic populations 
persist. Near-complete retention of cryptobenthic 
larvae close to natal reefs may solve this paradox. 
Second, driven by olfactory and auditory cues, 
cryptobenthic larvae show stronger natal homing 
than similarly sized large reef fishes (27) and can 
have very short dispersal distances (28). Third, 
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Fig. 1. Global dominance of cryptobenthic reef fishes in the near-reef 
ichthyoplankton. (A) Cryptobenthic larvae (blue) account for two- 
thirds (65.7%) of the larval reef fish pool <10 km from reefs, whereas 
large reef fish larvae (light gray) dominate >10 km from reefs. Crossbars 
represent predicted medians (+95% credible intervals) from a Bayesian 
beta-regression model; circles represent the raw data (12). (B) The 

high proportion of cryptobenthics (colored pie slices) in the near-reef 
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ichthyoplankton is consistent across major biogeographic coral reef 
regions. Dots represent separate studies. (C) Average contribution of 
reef fish taxa to global near-reef ichthyoplankton. The three highest 
contributing taxa are cryptobenthic, represented by photographs of 
adult Eviota infulata (Gobiidae) (D), Scartella cristata (Blenniiformes— 
Blenniidae) (E), and Cheilodipterus quinquelineatus (Apogonidae) 

(F). Families contributing <O.1% were omitted for clarity. 
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Fig. 3. Ecosystem-scale effects 
of the demographic dynamics 
of cryptobenthic reef fishes. 

(A) Cryptobenthics far outnumber 
large reef fishes in cohorts of larvae 
that recruit to reefs. (B) At settle- 
ment, large reef fish recruits are, 
on average, slightly larger than 
cryptobenthics. (C) Cryptobenthics 
contribute little (~13%) to net 
biomass production but produce 
almost 60% of consumed reef fish 
biomass because of exceptionally 
high turnover. (D) Standing stock 
biomass of large reef fishes far 
outweighs that of cryptobenthics, 
although adult abundances are 
approximately even. (E) Despite 
higher gamete output from 

large reef fishes, cryptobenthic 
larvae dominate the near-reef 
ichthyoplankton. This restarts the 
“crypto-pump” through rapid 
replenishment of consumed 
individuals. Uncertainty estimates 
in (C) and (D) are based on 

100 iterations of the full model. 


Fig. 4. Phylogenetic positioning and species richness in cryptobenthic 
and large reef fishes. (A) Cryptobenthic reef fishes (blue) have few 
independent origins but (B) account for almost half of all reef fish 
species (2799 species). Black, reef fish taxa; gray, non—-reef fish taxa 
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cryptobenthic larvae have limited yolk sacs and 
ingest prey immediately after hatching, indi- 
cating dependence on resource-rich, near-reef 
environments (8, 29). Finally, remaining close 
to natal reefs during development should result 
in fine-scale genetic structuring. With few ex- 
ceptions (30), this is observed in cryptobenthic 
fishes (10, 27, 31). Collectively, this suggests 
that the “pelagic” larvae of most cryptobenthics 
remain close to their natal reefs (32), resulting 
in prodigious near-reef abundances and a unique 
role of cryptobenthics in coral reef ecosystem 
functioning. 

Larval retention may lead to two evolutionary 
consequences: (i) rapid speciation through micro- 
allopatry arising from restricted gene flow among 
populations and frequent reproductive isolation 
0, 33, 34) and (ii) a higher risk of extinction 
than commonly assumed for small marine fishes 
(35) because populations can easily become ephe- 
meral and disappear after stochastic environ- 
mental changes (36). If these processes scale up 
to macroevolutionary levels, then cryptobenthics 
should be phylogenetically rare but species rich 
and this is indeed the case. Few reef fish families 
have successfully adopted cryptobenthic lifestyles 
and major diversification events are restricted 
to two lineages (Fig. 4): the larger Blenniiformes 
and the Gobiaria (gobies and apogonids). 
Nevertheless, these lineages are among the most 
rapidly diversifying clades of actinopterygian 
fishes (37) and cryptobenthics collectively ac- 
count for almost half (44.5%) of total reef fish 
biodiversity (Fig. 4) (13). 

In summary, through their extraordinary larval 
dynamics, rapid growth, and extreme mortality, 
the hyperdiverse consortium of abundant, tiny, 
and short-lived cryptobenthic species appears to 
be a critical functional group on coral reefs. The 
“dark productivity” provided by cryptobenthics 
underpins reef fish biomass production and 
supports the characteristic fast-paced dynam- 
ics of modern coral reefs. 
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FISHERIES 


The small world of global marine 
fisheries: The cross-boundary 
consequences of larval dispersal 


Nandini Ramesh’, James A. Rising”, Kimberly L. Oremus” 


Fish stocks are managed within national boundaries and by regional organizations, but 
the interdependence of stocks between these jurisdictions, especially as a result of 
larval dispersal, remains poorly explored. We examined the international connectivity of 
747 commercially fished taxonomic groups by building a global network of fish larval 
dispersal. We found that the world’s fisheries are highly interconnected, forming a 
small-world network, emphasizing the need for international cooperation. We 

quantify each country’s dependence on its neighbors in terms of landed value, food 
security, and jobs. We estimate that more than $10 billion in annual catch from 2005 
to 2014 is attributable to these international flows of larvae. The economic risks 
associated with these dependencies is greatest in the tropics. 


Larval connectivity patterns have been ana- 
lyzed at both the regional (6, 9-12) and global 
(4, 13, 14) levels and have been used to sug- 
gest changes for spatial management and 
conservation (72, 15). However, the impact 


arine fisheries supply food and liveli- 
hoods to millions of people around the 
world (7). Though fisheries are typically 
managed at the scale of national exclu- 
sive economic zones (EEZs), many fish 


populations are connected beyond EEZ bound- 
aries (2-6). Whereas pelagic species can be 
tracked across international borders as adults 
(7), nonpelagic populations connect primarily 
via the dispersal of fish eggs and larvae, forms 
that cannot yet swim by ocean currents (2, 8). 
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on fisheries of larval connectivity across EEZs 
is not well understood, even though more 
than 90% of the world’s fish are caught with- 
in EEZs (16). 

On the scale of a single species or region, 
this connectivity can be analyzed empirically 
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through genetic testing (9, 10). For analyses on 
larger scales, dispersal patterns can be estimated 
using biophysical models that combine oceano- 
graphic data with an understanding of the bi- 
ology of the stocks (4, 14, 17). Such efforts can be 
challenging, because species vary widely in larval 
timing and duration and currents vary with the 
seasons; therefore, generalizations can be mis- 
leading. More realistic inputs can be achieved 
by using life history traits for each species, in- 
cluding time and place of spawning and larval 
duration. Sensitivity analyses can help to en- 
sure that results are robust to changes in key 
assumptions (14), while empirical bounding can 
safeguard against predicting unrealistic disper- 
sion outcomes (6). 

Network analysis has previously been applied 
to marine systems to describe the connectivity of 
plankton communities (78), local fishing com- 
munities (19, 20), and marine reserves (14). 
Networks of larval flows have been used to 
identify “hub” subpopulations for protection 
on a regional scale (72). 

In this study, we combined oceanographic and 
life history data for 706 species and 434 genera 
of commercially harvested fish to estimate their 
connectivity across 249 EEZs and constructed 
a network representing the larval flows between 
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Fig. 1. The network of spawn-attributed catch flows between EEZs. Each EEZ is a node (circle) of the network and its color represents its network 
community. The connectors or edges in this network flow clockwise from source to sink, with their thicknesses representing the magnitude of the net flow 
of caught biomass between the EEZs. Only the edges in the upper tercile of edge weights are shown, for clarity (see SM 3.2 for the full network). The size 
of each node represents its out-degree, i.e., the number of other EEZs for which it acts as a source of fish larvae, including connections not shown in this image. 
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nations. Nations that depend heavily upon their 
neighbors for recruitment risk losing part of 
their catch if the fisheries in the source EEZs 
outside their jurisdiction are poorly managed. 
We quantified these risks in economic terms 
and identified regional “hotspots” of risk for 
catch, fishery employment, and food security. 

We used a particle-tracking system (27) with 
time-varying ocean currents (22) and species- 
specific life histories (23) to simulate the dis- 
persal of eggs and larvae through a dynamic 
ocean. We placed multiple simulated particles 
for each species based on the timing and loca- 
tion of that species’ spawning and let them drift 
for their larval duration to obtain a probabilistic 
estimate of species-specific larval trajectories. 
We used a random-walk parameterization (27) 
that adds a small velocity at every time step to 
account for turbulent motion at small scales 
[see section 3.1.2 in the supplementary mate- 
rials (SM 3.1.2)]. 

We empirically bounded our results by dis- 
carding particles that arrived in regions where 
the species is not present in observed catch 
data (16). For a given EEZ, catch is attributed 
based on the proportion of particles arriving 
there from each spawning country (see SM 1.1). 
This proportionality forms the core assumption 
of our model. We tested our main results with a 
series of analyses of sensitivity to this assumption. 
These included reducing spawn floating dura- 
tion to account for uncertainties in spawning 
mortality (2, 24), introducing return adult spawn- 
ing migration (25) (see SM 3.6), and distinguish- 
ing different levels of recruitment limitation. 

We estimated how much of each country’s ob- 
served catch comes from its neighbors by con- 
structing for each species a transition matrix that 
describes the probability of its offspring dispers- 
ing from one EEZ to another. This transfer of 
biomass between nations’ EEZs is represented 
as a network in Fig. 1. 

Each connector of the network represents 
net flows of fish from one country to another. 
Countries that depend on inflows of juvenile 
fish to maintain their local populations require 
international cooperation to ensure sustainable 
fisheries. Our analysis of these flows revealed 
that a large proportion of marine fisheries within 
EEZs form a single, global network (Fig. 1). 

We found that the global network of marine 
fisheries is a scale-free, small-world network. 
The scale-free network property, common in 
natural systems (26), is characterized by an ex- 
ponential distribution of the number of con- 
nections from each node (see SM 3.2). This 
exponential degree distribution results in a 
“hub-and-spoke” structure that is resilient to 
random disturbances because of the large num- 
ber of less-connected countries from which dis- 
turbances do not easily propagate to other parts 
of the network. However, a disturbance to any 
of the highly connected hubs in a scale-free net- 
work can affect numerous surrounding nodes. 
In this context, habitat destruction, overfishing, 
or environmental change in a hub EEZ could 
have impacts that spread beyond its own bound- 
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Fig. 2. Regional currents and community networks. (A to D) The speed (shown in colors, in 
centimeters per second) and direction (arrows) of ocean surface currents in four regions with 
interconnected fisheries (West Africa, Baltic Sea, the Caribbean, and Western Pacific) during 
the month of maximum spawning activity in each (August, May, June, and May, respectively). 

(E to H) The corresponding subset of the global network encompassed by the four regions. 
Colors, node sizing, and connector directions are as for Fig. 1. Nodes are arranged to 
approximately correspond to geographic locations of the EEZs. 


aries. Conversely, targeted efforts to manage 
fisheries within these hub EEZs could benefit 
many nations. 

To demonstrate the relationship between cur- 
rents and the network of larval dispersal, we 
focused more closely on four regions (Fig. 2). 
The differences between the regional networks 
and average current speed arise from the details 
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of current speeds during spawning, larval du- 
ration, and empirical observations of species 
presence or catch. The influence of the Guinea 
Current on the connectivity of West Africa’s fish- 
eries can be seen in the large number of EEZs 
that act as sources to their eastward neighbors, 
especially between Guinea-Bissau and Nigeria. 
While the strongest connections are typically 
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Fig. 3. Countries with highest outflowing and inflowing catches. 
(Left) Top 20 countries sorted by total outflowing catch (in thousands of 
tons) and value [in U.S. dollars (USD)] at risk. (Right) Top 20 countries 
sorted by total inflow of catch (thousands of tons) and value (USD) at risk. 
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Fig. 4. Hotspot map showing fishing dependency on spawning 
grounds in neighboring waters, by country. Countries are shaded 

by catch (in thousands of metric tons) at risk, with darker shades 
representing higher catches. Icons depict EEZs that are the most 
dependent on their neighbors. The catch icon indicates that more than 


between adjacent EEZs, many connections also 
extend over longer distances. In contrast, the 
Baltic Sea has substantially weaker currents. 
There, the largest outward flows originate from 
Sweden and Norway, which have the region’s 
longest coastlines. In the Caribbean, the North 
Brazil Current flows northwestward along the 
South American coast, and consequently many 
of the EEZs lying along this current act as 
sources for the Lesser Antilles. Within the Lesser 
Antilles, the density of small EEZs gives rise to a 
highly interconnected, complex network struc- 
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ture. The effect of the northward flow along this 
island chain can be inferred from the larger node 
sizes among the EEZs lying in its southern por- 
tion. In the Western Pacific, strong currents 
dominate in the equatorial ocean, with weaker 
currents at higher latitudes. The large areas en- 
compassed by this region’s EEZs mean that, un- 
like the other regions, most connections are 
between immediate neighbors. 

The small-world property implies that it is 
possible to traverse the global network in a small 
number of steps, on average. Within this net- 
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Maldives 


For both catch and landed values, data from 2005 to 2014 were used 
and attributed to larvae, by species. Resilience levels represent the 
estimated decline a population can endure without being considered 
vulnerable to local extinction. 
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30% of a country’s catch value is dependent on neighboring spawning 
grounds, the GDP icon represents a risk to more than 0.8% of its 
GDP, the labor icon represents that more than 1.5% of its jobs are 
vulnerable, and the food security icon represents a food security 
dependence index >1.1%. 


work, there exist smaller clusters or communi- 
ties that are tightly connected. Most of these 
clusters internally exhibit the small-world prop- 
erty. In theory, this property of the global fish- 
eries network suggests that disturbances to a 
large hub could propagate via cascading effects 
on the surrounding spokes. 

A key question is whether disruptions to a 
given EEZ actually propagate in this manner. A 
stock’s response to external shocks depends on 
both its population dynamics and mortality from 
fishing, which can be affected by management 
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(27). Some fish stocks are biologically capable 
of replenishing themselves when their numbers 
dwindle, provided that fishing pressure is re- 
lieved, reducing the likelihood that disturbances 
will propagate. However, “recruitment-limited” 
stocks are vulnerable to a decline in spawning 
population, making it more likely that distur- 
bances will spread across the network even if 
the receiving fisheries are managed. We adopted 
FishBase’s classification of stock resilience 
as a proxy for this type of density dependence. 
For high-resilience stocks, which are generally 
not recruitment limited, our measure of stock 
dependence overestimates the extent to which 
stocks will be reduced if recruitment inflows fail. 
For those stocks classified as having medium or 
low resilience, however, we found a strong cor- 
relation between our simulation’s predictions 
and observed variances in stock levels (see SM 
3.5). Even for countries whose fisheries mostly 
comprise non-density-dependent stocks, these 
larval inflows serve as a buffer against fishery 
collapse within their waters. 

To contextualize our results, we estimated the 
economic significance of the network’s interna- 
tional connections. First, we considered the 
amount and value of catch that flows in and out 
of each EEZ (Fig. 3). Japan, China, and Alaska 
are responsible for the greatest outflows, reflect- 
ing their productive waters. However, having 
fewer neighbors makes them smaller hubs 
(Fig. 1). Indonesia has the most landed value at- 
tributable to other countries, due to its high- 
value catch and many neighbors. The countries 
with the greatest catch inflows are generally 
those with the largest fisheries. 

Next, we identified nations that are potentially 
most vulnerable, in socioeconomic terms, to 
the management of neighboring waters (see 
table S5). In Fig. 4, we highlight countries that 
depend the most on the spawning grounds of 
neighbors in terms of their total catch, gross 
domestic product (GDP), number of jobs in the 
fishery industry, and a fishery food security de- 
pendence index (28). The most vulnerable nations 
are concentrated in the hotspot regions of the 
Caribbean, West Africa, Northern Europe, and 
Oceania. The risks to national GDP and labor 
force are generally highest in the tropics. How- 
ever, our measure of food security risk also 
identified a few European nations. 
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Our analysis showed that about $10 billion 
worth of annual marine catch may rely on 
transnational exchanges of fish offspring. These 
dependencies form a single global network, in- 
dicating that marine fisheries, even within na- 
tional boundaries, constitute an interconnected, 
globally shared resource. 

This network’s scale-free and small-world prop- 
erties imply that fish stocks from a small num- 
ber of EEZs provide benefits to a large number of 
“downstream” countries. The most vulnerable 
nations are clustered in a few hotspot regions 
(Fig. 4). This pattern lends further support to the 
use of international frameworks, such as large 
marine ecosystems and marine protected area 
networks (29, 30). 

Further research is needed to understand how 
small-scale coastal processes, larval behavior, and 
fisheries management affect this connectivity. 
Beyond the spawning connections studied 
here, national fisheries are interdependent through 
the movement of adult fish, population shifts 
under climate change, and international fishing 
treaties. In particular, the role of adult fish mi- 
gration in driving international connectivity re- 
mains an important question. While a more 
detailed analysis is required to accurately de- 
scribe dispersal pathways of individual species, 
this study highlights the role of larval connec- 
tivity across international boundaries and the 
need for multilateral cooperation for sustain- 
able management of these shared resources. 
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new products 


Wide-Mouth Bottle and Cap System 
Designed to help scientists accommodate 
sample quantities from 250 mL to 20 L, the 
Media Bottle and Multiport Cap System 
offered by DWK Life Sciences is used for 
biological and chemical reactions, plant cell 
culture, photobiological studies, and sample 
collection and storage. The wide neck simpli- 
fies transfer of samples, including powders, 
pastes, granules, and liquids. Spoons, spatu- 
las, tweezers, and large funnels are easily ma- 


a nipulated through the opening. Made from 33 
expansion, low extractable borosilicate glass 


in conformance to U.S. Pharmacopeia Type | standards, the bottle is 
resistant to acids, corrosive chemicals, and thermal shock, withstand- 
ing temperatures as high as 500°C. When combined with the multiport 
cap, the bottle is suitable for dilutions, fermentation, and culturing. 

It is available in clear glass, an amber glass coating for light-sensitive 
materials, or with KimCote, a safety coating for added protection in the 
event of breakage. 

DWK Life Sciences 

For info: 800-225-1437 

www.dwk.com/bioreactor 


High-Yield Protein Production in CHO Cells 

CHOgro High Yield Expression System from Mirus Bio offers a rapid 
means for developing protein-based biotherapeutics within a single-host 
cell type, with titers that exceed all other commercially available tran- 
sient expression systems. Transient gene expression has been gaining 
popularity as a means for producing antibodies and other recombinant 
proteins in mammalian cells. As the demand for recombinant proteins in 
drug discovery continues to grow, and with the majority of therapeutic 
antibodies being produced in Chinese hamster ovary (CHO) cells, there 
is aneed for a fast, reliable, and robust transient expression platform. 
CHOgro High Yield Expression System consists of CHOgro Expression 
Medium, L-glutamine and poloxamer 188 medium supplements, CHOgro 
Complex Formation Solution, TransIT-PRO Transfection Reagent, and 
CHOgro Titer Enhancer. This combination streamlines the protein ex- 
pression process—high-antibody titers can be achieved in seven days. 
Mirus Bio 

For info: 888-530-0801 

www.mirusbio.com 


Automated Colony Counting System 

AutoCOL from Synbiosis is an integrated system that allows microbiolo- 
gists to load plates and count colonies on up to 100 plates in 

30 min—ideal for improving throughput and accuracy in highly regulated 
laboratories. The plate feeder automatically identifies plates using a 
barcode reader and counts colonies on any agar plate. With AutoCOL, 
microbiologists can set pass/fail limits and the system will track and stack 
pass/fail plates in specific positions after reading them, ensuring 
accurate, walk-away counts for microbial applications, such as 
environmental monitoring, quality control labs, bioburden testing, and 
water analysis. 

Synbiosis 

For info: 800-686-4451 

www.synbiosis.com 
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Automated NGS Library Prep 

Free yourself from manual NGS library prep even on low-throughput 
experiments, with the epMotion 5073m NGS solution from Eppendorf. 
Until now, automated construction of next-generation sequencing (NGS) 
libraries has only been economical for preparations of at least 96 samples. 
Smaller sample numbers had to be processed manually using pipettes, 
which is timely and can incorporate pipetting errors. epMotion can 
process up to 24 samples simultaneously in a fully automated fashion. 
Smooth operation is guaranteed by ample space for pipette tips, three 
required single- and multichannel pipetting tools with a range of 

5 uL-300 uL, and a special rack that accommodates all required solutions. 
The optimized epBlue software enables smart use of the single- and multi- 
channel dispensing tools to optimize pipetting steps and tip use, even in 
the case of odd sample numbers. The system saves time and money, and 
reduces waste—all on a footprint of only six letter-sized pieces of paper. 
Eppendorf 

For info: 800-645-3050 

www.eppendorf.com/ngs-compact 


Microbioreactor System 

The ambr 15 cell culture Generation 2 from Sartorius Stedim Biotech 
replicates laboratory-scale bioreactor performance at the microscale (10 
mL-15 mL) and controls up to 48 single-use bioreactor cultures in parallel. 
The system offers increased flexibility and expanded capability for clone 
selection, media and feed optimization, and early process development 
work. It also provides a robust screening platform for development of cell- 
and gene-therapy processes, including HEK293 for viral vector production, 
T cells, induced pluripotent stem cells, and other immune-derived cell 
lines. The ambr clone selection software, included as a one-year license, 
provides simplified, streamlined multivariate data analysis for faster, 
more consistent cell-line screening and ranking. Adaptation of cell lines to 
different media for media screening studies can be performed in ambr 15 
microbioreactors using new culture-passage steps and rapid vessel-drain 
functionality. New media-mixing steps automate the creation of media 
blends, eliminating the need to premix, saving time, and reducing errors. 
Sartorius Stedim Biotech 

For info: +49-(0)-551-308-0 

www.sartorius.com 


LC/MS and GC/MS Sample Preparation 

Together with sister company J.G. Finneran Associates, Porvair Sciences 
offers an unmatched selection of innovative glass vials, seals, closures, 
and microplates to enhance liquid and gas chromatography (LC/GC) 

and mass spectrometry (MS) workflows. In addition, MS-certified kits for 
sample preparation and use in autosamplers are available to simplify and 
save time. The highly acclaimed P3 (Protein Precipitation Plate) sets the 
standard for efficiently and economically removing proteins from biologi- 
cal samples that can interfere with downstream LC/MS and GC/MS ap- 
plications. For over 20 years, Porvair Sciences’ Microlute 96-well SPE plates 
have been helping scientists increase the throughput of their MS analyses 
by speeding up sample preparation. The complete Microlute system 
provides a matched filter plate, vacuum manifold, and choice of collection 
plates with the option of a dedicated sample concentrator. In independ- 
ent comparative testing, Porvair Sciences’ deep-well plates have been 
repeatedly shown to contain near-zero contaminants, ensuring protection 
against leaching and interference with your LC/MS or GC/MS analyses. 
Porvair Sciences 

For info: 800-552-3696 
www.microplates.com/resources/porvair-chromatography-catalogue 
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Multiple Faculty Positions Open in RNA Biosciences 


Fo | | OW u S fo r j 0) b Ss The Center for RNA Biomedicine, recently becoming a pioneering part of the $150M University of Michigan 
U Biosciences Initiative, solicits applications for faculty positions at the assistant professor level. The faculty 
Ca ree r advice positions will be on the tenure track with university year appointments starting Sep. 1, 2020, or Jan. 1, 2021. 


Candidates must have the following qualifications: 
a n d m re) re | eA PhD, MD, or other terminal degree 

. *Evidence of superlative scientific accomplishment and scholarly promise 
*Depending on field, evidence of teaching excellence 


Primary school and departmental affiliation(s) will be determined by the applicant’s qualifications and 
preferences, and by relevance of the applicant’s research program to departmental initiatives and themes. 
@ScienceCareers We welcome applications from outstanding scientists in any area of RNA research complementary to 
existing expertise at Michigan, with particular emphasis on RNA drug targeting or as medicine, structural 
biology of RNA nanomachines, RNA structural in vivo profiling, RNA protein interaction profiling, and 
in vivo analysis of long non-coding RNA function. For further information about the Center for RNA 
Biomedicine’s current research areas, please visit rna.umich.edu. 


/ScienceCareers All applications must be submitted online at rna.lsa.umich.edu/facRecruiting/. You will be asked 
to upload the following materials: a cover letter, a curriculum vitae, a brief summary of recent research 
accomplishments and statement of future research plans, and a statement of teaching interests and philosophy. 
Candidates should also provide names and contact information for at least three references, as instructed in 
the online application form. To ensure full consideration, all materials should be received by Sep. 15, 2019. 
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supportive of the needs of dual career couples and is an Equal Opportunity/Affirmative Action Employer. 
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Faculty Position in 
Cardiovascular Sciences 


Virginia Tech (https://vt.edu/) is recruiting to fill tenure-track open rank faculty positions in 
cardiovascular sciences. The newly recruited faculty members will be part of a major initiative to 
expand the existing cardiovascular biomedical research team at the Fralin Biomedical Research 
Institute (FBRI) at Virginia Tech Carilion (https://research.vtc.vt.edu/) located on the Health 
Sciences and Technology campus in Roanoke, Virginia. 


The research institute, led by Dr. Michael J. Friedlander and located in a 100,000 purpose 
designed facility is entering its tenth year with over 25 research teams, 350 employees and 
students with over $125M in extramural research funding. A new 140,000 SF research building 
representing a $90M investment from the state, Virginia Tech and Carilion Clinic and major 
additional investments from the university including a transformative $50M gift to the research 
institute, will open in spring of 2020. The new building will house state-of-the art labs and core 
facilities including those for in vitro, laboratory animal model, human subject and computational 
based research programs - high field (9.4T) MRI, micro PET, surgical research suites, metabolic 
chambers, scanning block face EM, full molecular biology, cellular imaging, histology, data 
analytics and research participant monitoring core facilities. Recruitment of outstanding 
faculty in cardiovascular sciences will build on existing research strengths in cardiomyocyte 
coupling, drug development, sudden cardiac death, arrhythmogenesis and vascular remodeling 
and regulation. Faculty will be embedded in a transdisciplinary research environment that will 
include additional new teams of investigators focused on metabolism and obesity, infectious 
disease/immunology, neuroscience and body-device interfaces and comparative oncology. 


Applicants for all positions should have an earned Doctorate (PhD, MD, DVM or combined 
degrees), postdoctoral research experience in cardiovascular science, a demonstrated record of 
accomplishment of substantial research productivity through major publications and either active 
major extramural funding (preferably from NIH) or strong evidence of the likelihood of obtaining 
such funding in the near term. The new faculty will join the FRBRI’s highly successful and nationally 
regarded Center for Heart and Reparative Medicine Research (CHRMR) led by Dr. Robert Gourdie 
(https://research.vtc.vt.edu/heart_center/). Collaborations will be encouraged within CHRMR, 
with the other groups at the FBRI and across Virginia Tech and with clinical colleagues at the 


adjacent Carilion Clinic, including the division of cardiology under the direction of Dr. David Sane. 


Successful candidates will be expected to establish and maintain a high-quality, extramurally- 
funded research program and participate in teaching and mentoring of graduate students as well as 
mentoring of undergraduate and medical students. Highly competitive salary, space, and start-up 
packages will be provided. 


Faculty members can draw from an outstanding and diverse pool of students to work with them in 
their research program. Specifically, Virginia Tech has over 6,000 graduate students with graduate 
programs in multiple biomedical disciplines including Translational Biology, Medicine, and 
Heath (http://www.tbmh.vt.edu), Molecular and Cellular Biology (https://www.meb.vt.edu), 
Biomedical Engineering and Mechanics (https://beam.vt.edu/graduate.html) and Biomedical 
Sciences and Pathobiology (https://www.vetmed.vt.edu/academics/bmvs/). In addition, the 
Virginia Tech Carilion School of Medicine (https://medicine.vtc.vt.edu/) is one of the nation’s 
most research intensive allopathic medical schools where every medical student carries out a four 
year hypothesis-driven research project under faculty mentorship. 


The FBRI at VTC is located in Roanoke, Virginia and is a public-private partnership between 
the Commonwealth of Virginia’s leading research-intensive university — Virginia Tech — and a 
major health care system — Carilion Clinic in Roanoke, VA. The research institute is located in the 
picturesque Roanoke Valley midway between Washington, D.C. and Charlotte, NC. 


Applications will be reviewed continuously until the position is filled. It is expected that initial 
interviews will begin in mid-summer of 2019. To apply, please submit your application including 
curriculum vitae, statement of research accomplishments and future research plan, and statement 
of teaching/mentoring philosophy at www.jobs.vt.edu, posting # TR0190068. Also, have at least 
three references post their letters of support to the same site. 


Virginia Tech recognizes the critical importance of diverse teams of scholars. It seeks to diversify 
its faculty along multiple dimensions and encourages applications from under-represented 
backgrounds. Virginia Tech is a public global land-grant university, committed to research, teaching 
and learning, and outreach to the Commonwealth of Virginia, the nation, and the world. Building on 
its motto of Ut Prosim (that I may serve), Virginia Tech is dedicated to Inclusive VT- https://www. 
inclusive.vt.edu/ serving in the spirit of community, diversity, and excellence. We seek candidates 
who adopt and practice the Principles of Community, which are fundamental to our on-going efforts 
to increase access and inclusion and to create a community that nurtures learning and growth for 
all of its members. 


Virginia Tech is an Equal Opportunity Employer. 


For inquiries regarding non-discrimination policies, contact the Office of Equity and Access 
at 540-231-2010 or Virginia Tech, North End Center, Suite 2300 (0318), 300 Turner St. NW, 
Blacksburg, VA 24061. 
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The Universidade NOVA de Lisboa invites applications for a full time 
position as Director of the Instituto de Higiene e Medicina Tropical 
(IHMT NOVA) 


The position of Director of the Instituto de Higiene e Medicina Tropical 
(IHMT NOVA) of Universidade NOVA de Lisboa is open for applications. 


The Director, whether of Portuguese or other nationality, must have a 
doctorate degree (PhD). Applications for Director must be from Full 
Professors or Research Coordinators from the Universidade Nova de 
Lisboa or from other teaching universities and research institutions, 
national or international. The Director must have scientific seniority in 
areas of interest to the mission of the IHMT NOVA (post-graduation 
education, scientific research, internationalization and cooperation for 
development) and relevant management experience. A good knowledge 
of spoken and written Portuguese and English languages is required. 


The Director must have initiative and strategic vision, as well as capacity 
to implement the statutory mission of the IHMT NOVA, promoting 
national and international contacts, and representing the IHMT in the 
Universidade NOVA de Lisboa and in the national and international 
contexts. 


The Director is also the main responsible for fundraising and promotion 
of the IHMT NOVA external image. 


Applications will be treated confidentially and must be sent before 
12 pm on the 15th of July, 2019 to eleicaodiretor2019@ihmt.unl.pt 


Other information may be found in the vacancy notice published at the 
IHMT site: https://www.ihmt.unl.pt/concurso-para-diretor-do-instituto- 
de-higiene-e-medicina-tropical-da-universidade-nova-de-lisboa/ 


Additional information can be requested to the Electoral Commission 
via the e-mail eleicaodiretor2019@ihmt.unl.pt 
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of which ones best fit your skills and interests. 
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Mission 

The Burroughs Wellcome Fund is an independent private 
foundation dedicated to advancing the biomedical 
sciences by supporting research and other scientific and 
educational activities. 


The Burroughs Wellcome provides approximately 
$40 million in annual funding across the biomedical 
research landscape and in STEM education in the Fund’s 
home state of North Carolina. BWF believes that a 
diverse scientific workforce is essential to the process 
and advancement of research innovation, academic 
discovery, and public service. 


The Opportunity 

The Burroughs Wellcome Fund seeks an innovative 
and creative leader dedicated to advancing biomedical 
science, promoting career development for biomedical 
researchers, developing leaders in the research enterprise, 
and championing STEM education. 


The President is responsible for developing, executing, 
and evaluating BWF competitive award programs and 
other program-related activities in targeted areas of 
biomedical research and STEM education. In particular, 
she or he will provide the visionary leadership for the 
Fund’s grantmaking strategy. The President will also 
promote the mission of the Fund by engaging with 
various networks, audiences, and communication 
channels. Further information on the position can be 
found in the President’s job description. 


To assure full consideration, applicants must submit by 
July 15, 2019, a curriculum vitae, a statement of interest 
(maximum of two pages) that includes one’s vision 
for directions for the Fund, and the name and contact 
information for three references. Interested candidates 
should submit these materials electronically to Barbara 
Evans at bevans@bwfund.org. 


Questions regarding the details of this position should 
be directed to Russ Campbell, Secretary of the Board 
of Directors of the Burroughs Wellcome Fund, at 
rcampbell@bwfund.org. 


BWF is an Equal Opportunity Employer and 
encourages women and candidates underrepresented 


groups to apply. 


See www.bwfund.org for full details. 
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By Pedro Resende 
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Learning how to pivot 


fter 5 years as a postdoc and co-principal investigator in an academic research lab, I am about 
to start my own company. It’s a big step, and I’m excited—and a little anxious—about how it 
will work out. I am very aware that things might not go the way I envision. But pivoting does 
not feel as intimidating as it could—because I’ve done it before, starting early in my career. 
In doing so, I’ve learned that career changes should not be feared. Instead, they are valuable 
opportunities for professional development and growth. 


When I finished my under- 
graduate training 14 years ago, I 
wanted to continue in academic 
research. But none of my appli- 
cations for Ph.D. programs was 
successful. I wondered whether 
spending time in an industry job 
might help me stand out. At the 
same time, I worried that going 
to industry would compromise 
my chances of ultimately pur- 
suing an academic career. Col- 
leagues and professors told me 
that a move to industry would 
be a path of no return, and that 
big career changes could be 
rough. It felt like a momentous 
decision that would set the tone 
for the rest of my career. 

Nonetheless, I decided to take 
the leap and accept a position at 
a biotech company. To my sur- 
prise, I loved working there. In 
fact, my experience was so posi- 
tive that I changed plans. I did not want to do a Ph.D. any- 
more; I wanted a career in industry. I thought I had found 
my vocation, and that my path was clear. 

But my position was just a 1-year contract, and when I 
started to look for my next industry job, I hit a bureaucratic 
obstacle. I was searching for jobs across Europe, and in 
many countries my undergraduate degree was not consid- 
ered sufficient training for the positions I wanted. So much 
for that supposedly clear path. 

It looked like it was time for another pivot—back to 
where I had started, applying to Ph.D. programs. Again, I 
wondered whether moving back and forth between indus- 
try and academia might put me at a disadvantage with po- 
tential future employers. But my previous transition had 
taught me I should be open-minded and embrace opportu- 
nities as they come, even if they are not my top choice at the 
time. You never know how things are going to turn out. So, 
I tried not to be too anxious about changing course again. 

During one interview for a Ph.D. program, the selection 


“Professors told me that 
a move to industry would be 
a path of no return.” 


committee focused on how un- 
common it was for someone 
working in industry to want to 
come back to academia. I had 
played my cards right; I stood 
out! The program accepted me, 
and I enjoyed my Ph.D. experi- 
ence. I loved the science, had 
a great mentor, and lived in a 
fantastic city. Even though I had 
pictured a different course, I 
didn’t have any regrets. 

When I finished my Ph.D., 
I still wanted to return to in- 
dustry. I thought my previous 
industry experience coupled 
with my improved credentials 
would make the transition rela- 
tively easy, but that did not end 
up being the case. I wasn’t get- 
ting any job offers. I was disap- 
pointed, but I took some solace 
in remembering how well piv- 
oting had worked out for me in 
the past. I was now much more comfortable moving from 
plan A to plan B—or even plan C. 

So, I started to apply for postdoctoral positions. I got a 
great offer to start an independent line of research in a group 
with an outstanding working environment, and my time there 
has been great. In addition to my research, I’ve developed my 
entrepreneurial and leadership skills as co-founder and 
president of my Ph.D. program’s alumni association. Looking 
back, plan B feels like a plan A—just as it did so many times 
before in my back-and-forth career. And now I’m ready to 
embark on plan C: setting out as an entrepreneur. 

I accepted a job in industry when my mind was in aca- 
demia. I found a job in academia when all I wanted was a 
job in industry. Yet I feel happy with my career path, and I 
look forward to the pivots yet to come. & 


Pedro Resende is a research associate at i3S (the Institute 
for Research and Innovation in Health) in Porto, Portugal. 
Send your career story to SciCareerEditor @aaas.org. 
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